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Abstract. We introduce the setting of extended metric-topological measure spaces as a general 
“Wiener like” framework for optimal transport problems and nonsmootlr metric analysis in 
infinite dimension. 

After a brief review of optimal transport tools for general Radon measures, we discuss the 
notions of the Cheeger energy, of the Radon measures concentrated on absolutely continuous 
curves, and of the induced “dynamic transport distances”. We study their main properties 
and their links with the theory of Dirichlct forms and the Bakry-Emery curvature condition, in 
particular concerning the contractivity properties and the EVI formulation of the induced Heat 
semigroup. 
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1. Introduction 

In the last years many papers have been devoted to the investigation of the connection between 
gradient contractivity, contractivity of transport distances and lower bounds on Ricci curvature 
and to the connection between metric and differentiable structures. In these investigations one 
can take as starting point either a metric measure space {X, d,m) or a Dirichlet form £ in 
L 2 (X, m). In particular [5] provided key connections between the two viewpoints, proving that 
under mild regularity assumptions the distance dg generated out of the Dirichlet form as in 
[13] induces a metric energy (called Cheeger energy in [3], [4]) equal to £, and that Ricci lower 
bounds can be equivalently stated either in terms of the Bakry-Emery gradient K -contractivity 
condition BE(A', oo), K € R, 


r(P tf)<e- 2Kt P t T(/) 


( 1 . 1 ) 


(here P is the semigroup induced by £), or in terms of K -convexity of the entropy along Wasser- 
stein geodesics (see also [30, 31] and also [22] for extensions to the case when upper bounds on 
the dimension are considered). The crucial link between the two formulations is provided by the 
characterization of the semigroup P as the EVI/<-gradient flow (see (1.2) below) of the entropy 
in the Wasserstein space. 

A typical assumption made in the above-mentioned papers is that the topological/measure 
structure is induced by the distance, and that the distance is finite: for instance, when one 
takes £ as starting point, one assumes that the topology induced by dg coincides with the initial 
topology of the space. However, there exist examples where the topology induced by the natural 
distance is too fine and the distance can be even infinite: the simplest and probably most studied 
and natural example is the so-called Wiener space, i.e. a Gaussian measure space endowed with 
the Cameron-Martin distance. 

The main goal of this and of the forthcoming paper [8] is a deeper investigation of the above- 
mentioned problems in extended metric structures, where extended metric spaces are sets X 
endowed with a symmetric and triangular d : X x X —> [0,oo], with d(x, y) = 0 iff x = y. 
Extended distances arise in a natural way either by taking the supremum sup \f(x ) — f(y)\ along 
a set T of functions which separate the points of X (this is precisely what happens with dg), by 
construction of length distances and more generally by action minimization. At this level many 
extension of the classical metric theory, for instance the existence of metric derivatives |i:|(t) 
for absolutely continuous curves x(t) are fairly trivial, since d induces equivalence classes in X 
which are classical metric spaces; on the other hand, already the example of the Wiener space 
shows that when when we are given a reference measure m on X it is very hard to work with the 
quotient structure, and it is much better to consider the space as a whole; also in many cases it 
happens that we are given a topology r in X, coarser than the topology induced by the extended 
distance. We axiomatize this richer structure with the concept of extended metric-topological 
space (X, t, d), characterized by the existence of a family A of bounded functions which separate 
the points of X and generate both the Hausdorff topology r and the distance d (x,y), the latter 
with the formula supj g _ 4 |/(x) — f(y) |. We denote in the sequel the algebra Lip b (A,r,d) of 
bounded, d-Lipschitz and r-continuous functions, which includes A and generates r as well. 
In view of the applications we have in mind in [8], we are not assuming that the family A is 
countable and, correspondingly, we do not need extra assumptions on t; the complete regularity 
of r is implied by the ^.-generating property and it will be sufficient for our purpose, provided 
the reference measure m is Radon. 

Now we pass to a more detailed description of the content of the paper. Section 2 is devoted 
to some measure-theoretic preliminaries, mostly borrowed from the very comprehensive mono¬ 
graphs [14, 39]. In particular we introduce the class of Radon measures, denoted by £P{X), and 
recall the basic compactness theorem for families of probability measures (see Theorem 2.1). 
Then we recall the dual formulation of the optimal transport problem when the marginals are 
Radon measures, following [29] (see also [50] for the analysis of optimal transport problem in a 
very general setup). 
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Section 3 contains basic and already well-established results of the metric theory, with their 
easy adaptation to the extended setting. The only, but essential, new ingredient is a self- 
improvement principle for solutions to the so-called EVI/^ (evolution variational inequality) gra¬ 
dient flows along a semigroup S 

^ ^d 2 (S t x, y) + yd 2 (S t x, y) < F(y ) - F(S t x) Vi > 0 , (1.2) 

(for all y at a hnite d-distance from some Sf.x) which allows to some extent to pass from (1.2) to 
the same inequality for the (extended) length distances d^, d^ induced by d (the former defined 
in (3.8) by e-chains, the latter defined in (3.9) by the minimization of the length of curves), 
see Theorem 3.5 for a precise statement which involves a powerful integral formulation of (1.2). 
Moreover in Corollary 3.6 we derive from EVI^- a discrete convexity property relative to d^ that, 
under suitable compactness assumptions, can be improved to convexity along all geodesics. 

In Section 4 we introduce metric-topological spaces (X, r, d) and we discuss a few prelimi¬ 
nary properties of them, in particular the density of Lip fe (A, r, d) in L 2 (X, m) for m Radon, 
compactness of measures in the space X D of paths and lower semicontinuity of the p-action, 
p E (l,oo), in this context. In Section 5 we extend to this setting (a priori neither separable 
nor metrizable) basic results relative to the extended distance in f^{X) induced by the qua¬ 
dratic cost d 2 . Denoting this distance by we prove compactness and lower semicontinuity 
theorems, the implication from Rd-convergence to weak convergence and the basic superposi¬ 
tion theorem which extends to a non-Polish setup the recent paper [34]. Thanks to this result, 
2-absolutely continuous curves pt in (0 > (X),W2) have a lifting to the space jd 0,1 !, i.e. there 
exists r] E X l 0,1 !) concentrated on 2-absolutely continuous paths 77 : [0,1] — >• (X, d) whose 
marginals are pt and satisfying 

J |?)(t)| 2 drj(r]) = \p t \ 2 for a.e. t E (0,1) . 


In Section 6 we recall the basic construction of the so-called Cheeger energy [17], adapted 
to the extended setting (X. r, d) with a reference measure m E ^(X). Following with minor 
variants [3] (these variants allow to bypass some measurability issues relative to the asymptotic 
Lipschitz constant) we set 

Ch(/) := inf liminf f g 2 dm , 

71—»■ OO J 

where the infimum runs among all sequences (f n ) C Lip fe (X, r, d) with lim n J \ f n — f\ 2 dm = 0 
and all m-measurable functions g n > Lip a (/ n ,-) m-a.e. in X (where Lip a is the so-called 
asymptotic Lipschitz constant, see (6.1)). Together with the construction of Ch there is the 
construction of a local object, called minimal relaxed slope and denoted with \Df\ w , which 
provides integral representation to Ch via Ch(/) = f |D/|^ dm when Ch(/) is hnite. As shown 
in [17] and [3], many classical properties of Sobolev functions extend to this setting; in addition, 
defining A / as the element with minimal norm in the sub differential dCh(f), it is well-defined 
a Heat flow in L 2 (X,tn) (linear iff Ch is quadratic) P, given by 4-Pt/ = AP t f, according to the 
evolution theory for maximal monotone operators in Hilbert spaces. 

The aim of Section 7 is to introduce two more extended distances induced by Ch, in the subset 
& a (X) C &(X) of measure absolutely continuous w.r.t. m. The first one, denoted by Wch, is 
a length distance in t? >a (X) whose definition is inspired by the Benamou-Brenier formula 

Wch(Potn, Pini) := inf jy ||p(|| 2 d t : p t E CE 2 (X, Ch, m) 

Here \\p' t \\ is the least function c(t) in L 2 (0,1) satisfying 



fPs dm - / fp t dm 


< 


!‘ c{r) U 


mlPr 


\ 1/2 

dm J 


dr V/E Lip & (A, r, d) 
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and the property above defines the class of curves in CE 2 (A, Ch, tu). The second one, denoted 
Id'Ch,*- has a dual character and it is defined by 

W 2 (p 0 ,Pi) := 2sup / ( ( j) 1 p 1 - cj) 0 p 0 ) dm , 

<t> J 

where the supremum runs along all the “formal” subsolutions to the Hamilton-Jacobi equation 

fit + |D<?!>t| 2 /2 = 0. At this high level of generality, we are only able to prove that Wd < 
Wch,* < ITch; however, when we pass from the global to the infinitesimal behaviour, these 
distances reveal much closer connections. Indeed, under the Bakry-Emery gradient contractivity 
condition |DP f /|^ < e ~ 2Af P t |D/|^ for some K € R one can prove that the length distance 
associated to Wch,* is Wch (see Remark 10.10); in addition, along curves pt = Ptp tn with 
p € L°£(X, tn) probability density, these distances are finite and the metric derivatives w.r.t. all 
these distances coincide a.e. in (0,oo) (see Corollary 7.5). 

Building on this and refining the analysis made in [3], we prove in the subsequent Section 8 
that the metric gradient flows in A ga (X) of the relative entropy functional 

Ent(pm) := J plogpdm (1.3) 

w.r.t. the distances Wd, Wch,* and Wch coincide with the Heat semigroup P in the P-invariant 
class of bounded probability densities under a mild lower semicontinuity assumption on |D~ Ent |, 
the descending slope of Ent w.r.t. Wd- 

Section 9 contains a key stability result for Cheeger’s energies, which deals with the case of 
a monotone family of (r x r)-continuous distances d* approximating from below d, as in the 
definition of extended metric-topological space. We prove convergence of the correspondent 
gradient flows and the formula 

Ch = (inf Chj) , 

be/ 

where Chj is Cheeger’s functional relative to dj and ()* denotes the lower semicontinuous envelope 
in L 2 (X, m). As a byproduct, we get also convergence of the corresponding L 2 Heat flows. In 
view of the applications given in Section 12 we include in the convergence result also the case 
when d,; are semi distances, i.e. d i{x,y) = 0 does not imply x = y. This inclusion requires an 
adaptation of the construction of Ch to the semimetric setting. 

In Section 10 we take the point of view of a strongly local and Markovian Dirichlet form 
£ endowed with a carre du champs operator T. In this context the definitions of Wch and 
Wen,* can be immediately adapted, formally replacing the minimal relaxed slope |D/|„, with 
Denoting by Wg and Wg^ the corresponding extended distances, also in this context 
the A'-gradient contractivity condition (1.1) yields that Wg is the length distance associated 
to Wgy, furthermore, we prove in Theorem 10.14 that (1.1) implies A'-contractivity of both 
squared distances w.r.t. to P , if A 2 (A, m) is separable we prove a partial converse, namely 
I\ -contractivity of Wg implies the A"-gradient contractivity (1.1). 

Section 11 provides the EVI/^ property (1.2) of S = P £ relative to the extended distances 
Wg^ and Wg. The proof first provides a duality estimate involving Wg t * and then, using the 
self-improvement principle of Section 3 and the relation between Wg )Jf and Wg, the final result. 
In conjunction with the compactness properties of the sublevels of Ent, the EVI/< property yields 
geodesic convexity, relative to Wg, of the sublevels of the entropy. In addition, when K > 0 also 
the sets {pm : IMIco < c} are convex and when K > 0 the Logarithmic Sobolev Inequality and 
the Talagrand Transport Inequality hold. 

In Section 12 we analyze more in detail the connection between the metric and differentiable 
structures. More precisely, starting from an extended nretric-topological space (A', d, t) endowed 
with a reference measure m € S?{X) we can build Ch and, assuming Ch to be quadratic, ask 
whether Ch fits into the theory of Dirichlet forms; the answer is affirmative and we prove, 
following essentially [4], that Ch is a strongly local and Markovian Dirichlet form, and that 
|D/| 2 , corresponds to T(/). Conversely, given £ we can easily build an extended metric measure 
structure by selecting a family A C {/ : T(/) < 1} of pointwise defined functions which separate 



EXTENDED METRIC SPACES 


5 


the points of X. It is important to understand to what extent these two constructions at the 
level of the energies and of the distances are each the inverse of the other, namely 

£ ->• d £ -> Ch d£ = £? d ->• Ch -> d Ch = d? 

We know from [43] that, in general, even for Dirichlet forms and for length distances [42], we 
can’t expect that the answer is always affirmative. In order to understand this question we 
identify special properties of Cheeger’s energies Ch and of distances dch associated to them, 
when one chooses as A, in the construction of dch, the class {/ G Cb(X) : |D/| W < 1}. For 
Cheeger’s energies, the special property is the so-called r-upper regularity, already identified in 
[5] and here proved and adapted to the extended setting: according to this property |D/| W can 
be approximated by T-upper semicontinuous functions bounding the gradients of approximating 
functions f n relative to finite distances, see Definition 12.4 for the precise statement. At the level 
of distances, the special property we need is that functions in the class {/ G Cb(X) : |D/|„, < 1} 
have to be 1-Lipschitz w.r.t. d. 

We prove that £ < Ch d£ and that equality holds iff the choice of A ensures the r-upper 
regularity of £ and that when this happens several other identifications occur, see Theorem 12.5 
for the precise statement; the proof, in part adapted from [5], relies on the identification results 
established in the previous sections and particularly on Section 9. Notice that in the “regular” 
setting of [5] where t coincides with the topology induced by dg, r-regularity can be also obtained 
as a consequence of the Bakry-Emery BE(iv, oo) condition and a weak Feller property of P. 

On the other hand, in Theorem 12.8 we prove that dch is always larger than d, and that 
equality holds if and only any function in {/ G Cb{X) : |D/| W < 1} is 1-Lipschitz w.r.t. d. 

These results are independent of the doubling and Poincare assumption considered in [30], [31] 
and generalize those of [5] to the extended setup. 

Section 13 describes classical examples of extended spaces (Wiener spaces, configuration 
spaces) and shows how they fit in our framework. 

Let us conclude by pointing out potential developments that we plan to investigate, at least 
in part, in [8]. The first one deals with the so-called measurable distances, namely distances 
which are pointwise defined only at the level of subsets of positive m-measure of the space. This 
class of distances appears for instance in [28], in connection with the short time behaviour of the 
heat kernel, and it has been deeply investigated in [48], [49], in particular looking for (extended) 
metric realizations of measurable distances. Another direction comes from Gromov-Milman’s 
theory of concentration, nicely developed in the recent monograph [41]: indeed, the notion of 
pyramid of metric measure spaces investigated in [41] displays some analogy with the monotone 
approximation property of the extended distance used in our axiomatization. In addition, the 
convergence result proved in Section 9 at the level of the Cheeger energies (for the special case of 
the monotone approximation) should be compared with the various convergence results (which 
essentially use, instead, the algebra of bounded 1-Lipschitz functions) developed in [41]. 
Acknowledgements. We thank V. Bogachev and D. Zaev for very useful technical and bibli¬ 
ographical suggestions. 
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2. Preliminaries 

2.1. Measure-theoretic notation, Radon measures, weak and narrow topology. For 

a Hausdorff topological space (X, r), we denote by C&(X) the space of bounded continuous 
functions / : (. X , r) —> R and by &(t) the Borel a-algebra of r. Throughout this paper 

&(X) denotes the class of Radon probability measures in X, (2-1) 

i.e. Borel probability measures /x having the property that 

for any B G and any e > 0 there exists a compact set K C B with fi(B \ K ) < e (2.2) 
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Notice that the Radon property is in general stronger than the tightness one, for which the inner 
approximation (2.2) is required only for B = X. 

Radon measures have stronger additivity and continuity properties in connection with open 
sets and lower semicontinuous functions; in particular we shall use this version of the monotone 
convergence theorem (see [14, Lem. 7.2.6]) 

lim [ fi d/t = [ 1™ fi d/t (2.3) 

i£i J J iei 

valid for Radon measures // and for nondecreasing nets of r-lower semicontinuous and equi- 
bounded functions /). By truncation, we can apply the same property to nondecreasing nets of 
T-lower semicontinuous /„; : X -G [0,oo]. 

By the very definition of Radon topological space [39, Ch. II, Sect. 3], every Borel measure in 
a Radon space is Radon: such class of spaces includes locally compact spaces with a countable 
base of open sets, Polish, Lusin and Souslin spaces [39, Thm. 9 & 10, p. 122], In particular 
the notation (2.1) is consistent with the standard one adopted e.g. in [3, 2, 45], where Polish or 
second countable locally compact spaces are considered. 

The narrow topology on £P(X) can be defined as the coarsest topology for which all maps 

fhd, from &(X) into M (2-4) 

are lower semicontinuous as h : X —> R varies in the set of bounded lower semicontinuous 
functions [39, p. 370]. It can be shown [39, p. 371] that it is a Hausdorff topology on &{X)\ 
when (X, r) is completely regular, i.e. 

for any closed set F C X and any xq € X \ F 
there exists / € C\,{X) with f(x o) > 0 and / = 0 on F, ^ ^ 

the narrow topology coincides with the usual weak one, induced by the duality with Cb(X). 
In fact, in a completely regular space every bounded lower semicontinuous function h is the 
upper envelope of the directed set Dh := {/ € Cb(X) : / < h}, so that (2.3) shows that 
J h dp. = sup { f f dp : f G D h j. 

One of the advantages to use the narrow topology in &(X) when (X, r) is a Hausdorff 
topological space is the following sufficient condition for the compactness [39, Theorem 3, p. 379] 
(in completely regular spaces it is a consequence of Prokhorov theorem). 

Theorem 2.1. Let (X, r) be a Hausdorff topological space. Assume that a collection XI c ^(X) 
is equi-tight, i.e. 

for every e > 0 there exists a compact set K e C X such that sup p(X \ K e ) < e . (2.6) 

ueM 

Then M has limit points in the class X(X) w.r.t. the narrow topology (in particidar, the weak 
topology induced by Cb{X ) when (X,t) is completely regular). 


2.2. Transport plans, gluing, optimal transport and duality. Let ( X , r) and (Y, a) be 

Hausdorff topological spaces and let fi € £?(X). We say that a map / : X —> Y is Lusin 
//-measurable (see e.g. [39, Chap. I, Sect. 5]) if 

for every compact set K C X and every 6 > 0 there exists a compact set 

(2.7) 

Ks C K such that / restricted to K§ is continuous and fi(K \ K$) < 5 . 

Notice that since // is a Radon measure the approximation property (2.7) holds in fact for every 
K € B§{t). 

If / is Lusin //-measurable than it is also Borel //-measurable (i.e. is //-measurable 

for every B G ^(u)); the converse is known to be true if (Y. a) is separable and metrizable [39, 
Thm. 5, p. 26], 

If / is Lusin //-measurable we denote by /j// € &(Y) the push-forward of // via /: it is 
the Radon measure defined by /jj//(R) := //(/ _1 (R)) for every Borel set B € 3${Y) (Lusin’s 
//-measurability of / is assumed in order to guarantee the Radon property of /#//). 
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If fi, v G &(X), we will denote by T(g,v) the class of admissible transport plans between g 
and v, i.e. Radon probability measures in &{X x X ) with marginals g and v respectively: 


T(g, v) = |tt € 7^(A" x X) : 7t(A x X) = /r(A) for all A G ^(r) 

7t(A x B) = v(B) for all B G 3S(r) | . 


( 2 . 8 ) 


It is worth noticing that T(g, v) is non empty (since it contains the unique Radon extension to 
3&(X x X) of the product measure g x v, see [39, p. 73]), convex and compact with respect to 
the narrow topology, by Theorem 2.1. 

We shall use the following gluing lemma: 


Lemma 2.2 (Gluing lemma). Let I = {0,1,... , N}, N > 2, and let (Aj)j £ / be Hausdorff 
topological spaces with X := IR e /Aj and corresponding projection p'. Let 7Tj, i = 1,... ,N, be 
Radon measures in Xj_i x X t satisfying the compatibility conditions 

J f(y) d7 Ti(x,y) = J (fix') d7r i+ i (x,y) V 0 G C b (Xi) for every i G { 1 , ..., N - 1 } . 

Then there exist Radon measures it in X such that (p* _1 , p*)# 71 " = 77 ?: f or every i G {1,..., N— 1 }. 
The same property (with obvious modifications) holds in the case L = N of a countable set of 
indexes. 


Proof. When / is finite, the proof is well-known in Polish spaces (see for instance [2, Lem. 5.3.2]) 
via disintegrations; however an alternative proof via Hahn-Banach theorem and Riesz-Markov- 
Kakutani theorem [14, Thm. 7.3.10 and Thm. 7.10.4] is possible in compact Hausdorff spaces, 
as indicated in [45, Exer. 7.9] in the case N = 2, and then proceeding by induction. By a simple 
exhaustion argument the result extends to Radon measures in Hausdorff topological spaces. 

In the case I = Nwe argue as in the proof of [2, Lem. 5.3.4], applying the general version of 
Kolmogorov-Prokhorov theorem given in [39, Thm. 21, p. 74 and its Corollary p. 81]. □ 


In the class of Radon probability measures in X, we consider the optimal transport problem 

inf 1 / c d7r : 7T^T(g,v)\ (2-9) 

[JxxX J 

where c is a bounded cost function defined in X x X. In the following proposition we denote 
by J f dv the inner integral, namely the supremum of f g dv among all immeasurable g with 
g < / pointwise. Even though the natural setting for duality theorems is provided by costs 
measurable w.r.t. the product cr-algebra, we will need to apply the duality theorem with lower 
semicontinuous costs c > 0. In this case the duality theorem still holds, when Radon measures 
are involved; this can be seen, for instance, proving via (2.3) the existence of c < c measurable 
w.r.t. to 3§{t) x 3S{t) with f c d7r = fc d7r. 

Proposition 2.3 (Duality). Let c : X x X — > R be a bounded and either (r x t) -lower semi¬ 
continuous or AS{t) x (t) -measurable function. For all //, v G ^(X) one has 


inf 


IXxX 


d-7r : 7i G T(/i,y) > = sup < / if du — / f dy 


( 2 . 10 ) 


where the supremum runs in any of the following three classes: 

(a) (f, if £${t)- measurable with if(y) — 4>(x) < c(x,y) and f \if \ dv + J |<^| dg < oo; 

(b) (f, if bounded £$ (r) -measurable with if(y ) — f(x) < c(x,y); 

(c) (i f,if) with if(y) = 4> c {y) = inf^ cf(x) + c (x,y) and (f belonging to the class 

T := < (f : X —>• [0, oo) : < f G C(K ) for some compact L\ C X, (f\x\K = c > ma xcf > . (2.11) 


Notice that in the cases (a) and (b) one can replace the inner integral f if dv with f if dv in 

( 2 . 10 ). 
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Proof. At this level of generality, the validity of (2.10) with the choice (a) has been proved in 
[29, Prop. 1.31, Thm. 2.14] for both classes of costs c. (b) still follows by [29, Lem. 1.8]: in 
fact one can assume that take values in the interval [inf c — |supc, |supc]. Let us pass 

now from (b) to (c). To this aim we can use once more the Radon assumption on p, which 
implies the validity of Lusin’s theorem (see [14, Thm. 7.1.13]) to find 4> n € F equibounded with 
/i({</> A 4>n}) i 0 and pointwise. Since 



do > 


4> c do > 


J 


if do 


we conclude. 


□ 


In some sections of this paper a reference measure m € will be fixed; we shall denote 

by & a (X) the subclass of measures fi <C m and by Ent the relative entropy w.r.t. m, equal to 
+oo on £P{X) \ P? a {X) and given by 

Ent(/?m) := J plogpdm (2-12) 

otherwise. Since m will be fixed, we do not emphasize it in the notations P? a [X\ Ent and we 
will also use the short notation \\f\\ p for ||/||LP(A',m)> P € [l,oo]. 


3. Extended metric spaces 


In this section we introduce some basic analytic tools for analysis in metric spaces. Most of 
them extend in a natural way to extended metric spaces, defined below. 

Definition 3.1 (Extended metric spaces). We say that (X, d) is an extended metric space if 
d : X x X —>• [0, oo] is a symmetric function satisfying the triangle inequality, with d (x,y) = 0 

iffx = y- 


When the condition d (x,y) = 0 iff x = y is weakened to d(x,x) = 0 we say that (X, d) is a 
(extended) semimetric space. 

The main example of extended semimetric space we have in mind arises from the construction 

d (x,y) : = sup | f(x) - f(y)\ , (3.1) 

f&A 

where A is any class of bounded functions / : X —>• M; it is an extended metric space iff A 
separates the points of X. Other natural examples arise by the construction of length distances 
(derived from distances or from action minimization), which need not to be finite. 

Since an extended metric space can be seen as the disjoint union of the equivalence classes 
induced by the equivalence relation 

x~y •<=> d(x, y) < oo 

and since the equivalence classes are indeed metric spaces, many results and definitions extend 
with no effort to extended metric spaces. For instance, we say that an extended metric space 
(A", d) is complete (resp. geodesic, length,...) if all metric spaces lu = {y : y ~ x} are 
complete (resp. geodesic, length,...). In particular any extended metric space has a unique, up 
to isometries, (extended) metric completion. 


3.1. Absolutely continuous curves and upper gradients. For D C R we denote by X D 
the space of maps g : D —>• X (no continuity is required in general) and by e* : X D —>• X the 
evaluation maps at time t € D, namely e*(r/) := rj(t). 

Let UC(D ; (X, d)) C X D be the space of d-uniformly continuous maps. For p € (l,oo), we 
denote AC P (D; (X, d)) C UC(D ; (A, d)) the subspace of p-absolutely continuous maps w.r.t. d, 
satisfying 

t 0 <ti < ■■■ < t n - 1 < t n , ti € D j < oo . (3.2) 

Notice that, thanks to the triangle inequality, the supremum above can also be realized as a 
monotone nondecreasing limit in the directed set of partitions of D, with the order induced 


' n —1 


**p(v,D) := sup <| 


. i =0 


(u+1 - uy- 1 
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by the set-theoretic inclusion. When the domain D is clear from the context we simplify the 
notation, writing If (X, d) is complete, it is easily seen that any 77 £ UC(D; (X, d)) 

has a unique d-continuous extension to D , and this extension is still d-uniformly continuous; if 
?7 € AC P (D ; (X, d)) then the extension belongs to AC P (D\ [X, d)) and the p-action stf v remains 
the same. The metric derivative \rj\ : (0, T) —» [0,oo] of y € AC p ([0, T]; (A^,d)) is the Borel map 
defined by 


|r)| (t) := limsup 




s—yt |<S 


and it can be proved (see for instance [2, Thm. 1.1.2]) that the limsup is a limit a.e. in (0,T), 
that \y\ € L p (0,T) and that d(g(s), 77 (f)) < f* |? 7 |(r) dr for all 0 < s < t < T. Furthermore, \y\ 
is the smallest L 1 function with this property, up to Lebesgue negligible sets, and one can easily 
prove that 


K(v)= [ T m\ p dt y V €AC p ([0,T]-(X,d)). (3.3) 

Jo 

We also recall the notion of upper gradient: we say that g : X — >■ [0, 00 ] is an upper gradient of 
/ : X —>• R (relative to d) if 11 —>- g{ri(t))\i)\(t) is Lebesgue measurable in (0,1) and 

I/(*/(!)) - /(*?( 0 ))| < [ g(y(r))\y\(r) dr 
Jo 


for any 77 £ AC P {[ 0,1]; (X, d)), p £ ( 1 , 00 ) (the dependence on p is harmless, thanks to reparam- 
eterizations). 


3.2. Gradient flows. Now we introduce the main concepts of gradient flows used in this paper: 
the first one is based on the energy-dissipation inequality and the second is characterized by a 
family of evolution variational inequalities, see [2] and [18] for much more on this topic. Both 
can be easily generalized to the extended setting and the case of gradient flows in Hilbert spaces, 
detailed in Proposition 3.4, is a very particular and important example. 

Let F : X —» R U {+ 00 } with (non empty) domain D(F) = {F < 00 }. The slope |DF| and 
the descending slope |D - F|(x) of F at x € D(F) are respectively defined by 


|DF|(x) := limsup 

y-*x 


\F(y)-F(x)\ 
d (y,x) 


|D F\(x) := limsup 

y-*x 


[F(y)-F(x)Y 

d(y, x) 


(3.4) 


with the convention IDF’Kx) = (D'TKx) = 0 if x is a d-isolated point. 

We say that a locally absolutely continuous curve x £ HC i 1 q C ((0, 00 ); (D(F), d)) is a metric 
gradient curve of F in the energy-dissipation sense if the Energy Dissipation Inequality 

ft 


F(x(t)) + ^ f (ji:| 2 (r) + |D F| 2 (x(r))^ dr < F(x(s)) 


(EDI) 


holds for all s, t £ ( 0 , 00 ) with s < t and also for s = 0 when the curve x admits a continuous 
extension (still denoted by x) to [0,oo) (recall the definition (3.4) of |D“F|). 

Notice that if |D _ F| is an upper gradient of F we can use the inequality 


F(x(s)) < F(x(t)) + f |D F\(x(r))\x(r)\ dr 

J S 


to obtain that equality holds in (EDI), that For is locally absolutely continuous in (0, 00 ) and 
that |x | 2 = |D~F| 2 o x = —(F ox ) a.e. in (0, 00 ). When x has a continuous extension to [0, 00 ) 
and F(x 0 ) < 00 , the absolute continuity holds in all compact intervals of [0, 00 ). 

For K £ M we introduce the function I k '■ [0, 00 ) —>• [0, 00 ) defined by 

ft p Kt __ 1 

I 0 (t) = t, I i<(t) := / e Ks ds = —-— if K / 0 . 

Jo 

In the following definition of EVIa- gradient flow we consider some lower semicontinuity con¬ 
ditions that are automatically implied by the local absolute continuity of x when F is lower 
semicontinuous and d is a finite distance. Moreover, the specification of the initial condition is 
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made in such a way that even initial conditions not in D(F), but at a finite distance from D(F), 
can be considered. 

Definition 3.2 (EVIa gradient flows). Let K E M. We say that x E AC^ oc ({0, oo); (D(F), d)) is 
a EVIa gradient curve of F if t i-a F(xt ) is lower semicontinuous in (0, oo) and for all y E D(F) 
satisfying d(y, x t ) < oo for some (and then all) t € (0,oo) one has 

^ \d 2 (x t , y) + ^d 2 (x t , y) < F(y) - F(x t ) Vt > 0 , (EVIa) 

where d + /df denotes the upper right derivative. 

If x has finite distance from D(F), we say that x starts from x if lirn irif^o F('x t ) > F(x) and 
li m 40 d(x t , y) -A d(x, y) for every y E D(F). 

Let us now point out some direct consequences of the definition of EVIa gradient curve. 
Monotonicity of the energy, uniqueness and contractivity. It is not difficult to show that, for 
EVIa gradient curves x, the map t i-a F(xt) is non-increasing in (0, oo) and F(xt) -A F(x) if x 
starts from x. Moreover one has the contractivity property 

d {x t+ 3, yt'+s ) < e~ Ks d{x t ,y t ') s, t, t' > 0 . 

By approximation this inequality can be extended to the case when either t = 0 or t' = 0 (but 
not both), for EVIa curves starting respectively from x, y. 

If t = t' = 0 the inequality holds in the weaker form 

d {x s ,y s )<e~ Ks inf d(x, y) + d(y, y) s > 0 , 


which reduces to the standard one when at least one of the initial points belongs to D(F). In 
particular the EVIa" gradient curve starting from x € D(F) is unique and satisfies d(x(t),x) —> 0 
as t 4- 0. 

Integral version and regularization. Integrating in (0, t) the differential inequality (EVIa) written 
in the form (^-d 2 (xt,y)) < e Kt (F(y) — F(Stx)) and using the monotonicity of F(Stx) and 
the convergence d(x s ,y) -A d(x, y) as s | 0 we get 

1 e~ Kt 

-d 2 {x t ,y) - —d 2 (x,y) < I - K (t)(F(y) - F(x t )) (3.5) 

for every y E D(F) with d(x,y) < oo, t > 0. 

In particular we obtain the regularization estimate (see also (3.7) below) 

F{x t ) < F(y) + 2I ^ d 2 (x, y) Vy € D(F), t > 0 . (3.6) 

Definition 3.3 (EVI^-semigroup of F). Let Del and let S : D x [0,oo) -A D he a semigroup. 
We say that S is an EVIa gradient flow of F in D if: 

(i) D D D(F) and every x E D has finite distance from D{F); 

(ii) for every x € D one has that t eA S tx is an EVIa gradient curve of F starting from x. 

(iii) S is K-contractive in D, i.e. d(Stx,Sty) < e~ Kt d(x,y) for every x, y E D. 

It is not hard to show that if (i) and (ii) hold, then condition (iii) is always satisfied in D(F ), 
so that it is usually omitted in all the cases when D = D(F), in particular when D(F) is dense 
in X. 

In general existence is much harder to prove and depends on structural properties of (X, d) 
and F. A particularly important case is provided by lower semicontinuous convex functionals in 
Hilbert spaces, that we are now briefly recalling (see for instance [2, Sec. 1.4] for the “metric” 
approach and [16] for the classic formulation via maximal monotone operators). 

If ( X , || • ||) is a Hilbert space, recall that the subdifferential dF(x) of a convex functional 
F : X -a (— 00 , 00 ] at x € D(F) is the convex closed set (possibly empty) 

dF(x) := F(y) > F(x) + (£, y - x) Vy € X] . 
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Proposition 3.4 (Gradient flows of convex functionals in Hilbert spaces). Assume that X is a 
Hilbert space, with distance d induced by the norm, and that F : X —>• [0, oo] is convex and lower 
semicontinuous. Then, the following properties hold: 

(a) the concepts of metric gradient curve and EVIo gradient flow starting from x coincide, 
for locally absolutely continuous curves x : [0, oo) —> X with xq = x € D(F); 

(b) For all x € D(F), there exists a unique metric gradient curve x(t) of F starting from x. 
The map St : x i-A x(t) defines an EVIo semigroup of F in D(F). 

(c) If x(t) is a metric gradient curve of F, then —x' + (t) = lim^oO*^) ~~ x (f + h))/h exists 
for all t > 0 and it coincides with the element with minimal norm of dF(x(t)). 

(d) If x{t), y(t ) are metric gradient curves of F, then d (x(t),y(t)) < d (x(s), y(s)), 0 < s < 
t < oo. 

(e) If x(t) is the metric gradient curve starting from x the following regularization estimates 
hold: 

F{x(t)) < F{z) + ^d 2 {z,x) , \D~F\ 2 (x(t)) <\D~F\ 2 (x) + ^d 2 (z,x) (3.7) 

for all z € D(F) and t > 0. 

3.3. EVI flows, length distances and geodesic convexity. If (X, d) is an extended metric 
space, we denote by d^ > d the extended length distance associated to d: since we are not 
making any completeness assumption at this level, it can be defined as 

d e(y,z) := supd e (y,z) = lim d e (y, z), where 
£>0 e±o 

r A l (3 ' 8) 

d ( j /, 2 ) := mf | 2^d(x n ,x„_i) : x 0 - y, xn = 2 , d(xv,_i, x n ) < . 

71— 1 

A second way to generate a length distance from d consists in minimizing the length of all the 
absolutely continuous curves connecting two points: we set 

d fly, z) := inf j J |x|(f) dt : x € AC{[ 0,1]; (X, d)), x 0 = y, x x = z| . (3.9) 

Notice that if x € AC([ 0,1]; ( X , d)) is an absolutely continuous curve connecting y to 2 we easily 
get 

d e(y,z) < / \x\(t) dt , so that d^ < d^ . (3.10) 

Jo 

Motivated by (3.10) we call d^ the upper length distance associated to d. 

(X, d) is called a length space if d^ = d. It is not difficult to check that d is a length distance 
if and only if for every x, y G X with d (x,y) < 00 and all e > 0 there exists an e-middle point 
z e , characterized by 

d(x,z e ) < ^d(y,z) +e , d{z £ , y) < ^d{y, z) + e . (3.11) 

With this in mind, one can easily check that d^ and df are length distances. If moreover {X, d) 
is complete one has d^ = d^ (a simple proof can be achieved by selecting, given x and y, e/4*- 
midpoints, i > 0, recursively; thanks to the completeness, the process converges to an absolutely 
continuous curve from x to y with length less than d^{x, y)+2e). In the next theorem we describe 
a new self-improvement principle for EVI/^ gradient flows. 

Theorem 3.5 (Self-improvement of EVI/^). Let F : X —>• MU{+oo} be a proper functional and 
let S be an EVI/^ gradient flow of F in X relative to d. Then S is an EVI/v gradient flow of F 
relative to d( and dp. 

Proof. We discuss the case of d^; the proof for d^ is analogous, working with continuous rather 
than discrete curves. 

Let us fix y € D(F), x € X with dp(x,y) < 00 ; the existence of e-middle points as in (3.11) 
for arbitrary couples at finite d^-distance easily shows that for every e > 0 and N > 1 there exist 
points Xi, i = 0,..., N, with xq = y, xjv = x and d(xj, ) < dp{x, y)2 £ /N , i = 0,..., (N — 1). 
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Set -Sj := i/N, x\ := S t Si Xi, and 6 = 1/iV; notice that for t > 0 we have x f n € D(F), by our 
assumptions on S. Using first the contractivity of S and then (3.5) (with y = x\ = S t Si Xi) we 
obtain 


^d 2 04 + 1 ,x‘) - e- KtS e- 2Kts ^d 2 (x i+1 , Xi ) 

—■ 2^ e — d _ (St Si aij-|_i, S^ Si Xj) 

< I-K(tS) [F(xj) - F(x t i+1 )] . 


Since 6 1 = N we have 


N—l 


fN -1 


X) T d2 (4+ l.*i) > X] d ( x i+u x i) 


i=0 

IV-1 


. i=0 


X -d 2 (xl +1 ,x\) > max 


d2 (*i+n*i) 


i=0 


and 


TV-1 


—2Ktsj 


X AM e 2AtSi d 2 (x i+ i,£i) < 2 2e e Kt5 d 2 (x, y) X 6e 

«=o « 

< 2 2£ e- x ^d|(x, y) ( jf 1 e- 2Kts ds + w 

with uj ($) —>• 0 as 5 —>• 0. Thus, dividing by 6 and summing up (3.12) we obtain 


(3.12) 


(3.13) 

(3.14) 


1 

2 




1 I-2A'( i )+ w 





Taking the limit along a family of e 4- 0 and N f oo, (3.13) and (3.14) show that the liminf of 
the left hand side of the last inequality provides un upper bound of ^d|(Stx,y), and this yields 

\d}(S t x,y) - I_2A d 2 (x, y) < t(F(y) - F(S t x)) . 

This implies the differential inequality at t = 0 and then (EVIa') for the extended distance d^, 
thanks to the semigroup property of S. □ 


Corollary 3.6 (Approximate geodesic convexity of F). Under the same assumption of the 
previous Theorem, let us choose x, y £ D(F) with d e(x,y) < oo, e > 0, and points xq, ... ,xn, 
N E N, corresponding to a uniform partition s n = n/N of [0,1] such that 

x 0 = x,x N = y, d e (xi,x i+ i) < jj\/d 2 {x,y) + e 2 0 < i < (N - 1) . (3.15) 

Then for all t > 0 one has 

K e 2 

F(S t x n ) < (1 - s n )F(x) + s n F{y) - — s„(l - s n ) d 2 (x, y) + 9I/ ,^ g n(l ~ Sn ) ' (3.16) 

In particular, (D(F), d^) and all sublevels of F are length spaces. If moreover the sublevels 
{F < c}, c € M, are complete in (X , d), then d £ = di on D(F ) x D(F). 

Proof. The inequality (3.16) can be obtained from the EVI^ property relative to d^ (whose 
validity is ensured by Theorem 3.5) arguing as in [19, Thm. 3.2], The equality d^ = d^ on the 
sublevels and then on D(F) follows by the equality between the two length distances induced 
by complete distances. □ 


An important application of the above result, that we will exploit in Section 11 (specifically, 
with F = Ent and r equal to the weak L 1 (A, m)-topology) concerns the case when the com¬ 
pleteness of the sublevels {E < c} can be improved to compactness with respect to a Hausdorff 
topology t on X for which d is lower semicontinuous. In this case it is not difficult to prove 
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that (D(F),de) is a geodesic space and that (see [18]) the EVI^ property relative to yields 
A'-convexity of F, i.e. for every x, y £ D(F) there exists a constant speed geodesic x : [0,1] — > X 
relative to connecting x to y such that 

d e(x s ,x t ) = 1 1- s|df(x, y), F(x t ) < (1 - t)F(x) + tF(y) - yt( 1 - t)dj(x, y) . (3.17) 

4. Extended metric-topological spaces 

We axiomatize metric-topological spaces by adding two (somehow competing) compatibility 
conditions between the distance d and the topology r. 

Definition 4.1 (Extended metric-topological spaces). Let (X, d) be an extended metric space 
and let t be a Hausdorff topology in X. We say that {X, r, d) is an extended metric-topological 
space if: 

(a) there exists a family of (r x r)-continuous bounded semidistances dj : X x I -> [0, oo), 
i £ I, with d = supj dj. 

(b) the topology r is generated by the family of functions 

Lip fc (X, r, d) := {/ : X —>■ M : f is bounded, d-Lipschitz, r-continuous} . (4.1) 

For every L > 0 we will also set 

Li Pb,L(^Ad) := {/G Lip 6 (X,r,d) : \f(x)-f(y)\<Ld(x,y) Vx,y £ X j (4.2) 

Let us make a few comments on the above definition. 


Boundedness o/dj. The boundedness assumption on dj in (a) is clearly not restrictive, possibly 
replacing dj by d, A n and enlarging the index set. 


Directed families of distances. Possibly passing from the index set / to the collection of its finite 
subsets we can assume with no loss of generality that / is a directed set. We shall often make 
this assumption in the sequel. 

Extended metric-topological structures generated by separating family of functions. If we don’t 
take the extended metric structure as a starting point, a definition easily seen to be equivalent 
can be given starting from a class A of functions which separate the points of X; then r is defined 
as the topology generated by A and the extended distance d(x, y) can be obtained by taking the 
supremiun of | f(x) — f(y) | as / runs in A, as in (3.1). Notice that in this case we can take as 
topology r the coarsest topology that makes all functions in A continuous, which is easily seen 
to be Hausdorff; since Lip b (X, r, d) contains A by construction, it turns out that condition (a) 
above is satisfied and condition (b) is satisfied as well, with dj(x, y) = |/j(x) — fi(y)\, A = {fi}iei- 
In addition, if the functions in A are already continuous for some preexisting topology to in X, 
this construction provides a topology r coarser than tq. 

Relations between t and d. Notice that condition (a) yields 

d is (r x r)-lower semicontinuous in X x X (4-3) 

and for every net (xj)j^j and every x £ X 

d (xj,x) — >• 0 implies Xj — >• x w.r.t. the topology r (4.4) 

Indeed, one can use assumption (b) and observe that f(xj) —>• f(x) for all / £ Lip ft (X, r, d). 
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Lipschitz functions generating t. We might equivalently express condition (b) by assuming that 
there exists a smaller family F C Lip(X, r, d) which generates the topology r, i.e. for every net 
(xj)j£j in X and every x € X 

f(xj ) —>• /( x) for all / E F implies Xj —» x w.r.t. the topology r . (4-5) 

In fact, by suitably modifying the set of distances dj without changing r and d, we can obtain 
an equivalent characterization of extended metric-topological spaces: 

Lemma 4.2. (X, r, d) is an extended metric-topological space according to Definition f.l if and 
only if (X, d) is an extended metric space and there exists a family of (r x r)-continuous and 
bounded semidistances d i, i € I, such that d = sup ig /dj and for every net (xj)j^j in X and 
X e x 

w.r.t. t limdj(xo, x) = 0 for every i E I . (4-6) 

ieJ 


lirri Xj = x 


Proof. If (X, t, d) is an extended metric-topological space according to Definition 4.1 we can 
simply consider the new collection of semidistances of the form 

df{x,y) := \f(x) - f(y)\, f E F = Lip M (X, r, d) (4.7) 

Conversely, if a family of (r x r)-continuous and bounded semidistances dj, i E I, satisfy 
d = sup igi -dj and (4.6), then it is easy to check that F := {dj(-, 2 :) : i € /, z E Xj is contained 
in \Aj) b (X. t, d) and generates the topology r. □ 

Approximation of continuous functions. Every / E Cb(X) admits the useful representation for¬ 
mula 


(4.8) 


which can be proved by passing to the limit with respect to ( i , n) E I x N in the inf and sup 
regularizations of / 

(4.9) 


/(*) 

= sup g(x) 
geL-(j) 

with 

/(*) 

= inf h(x) 
h&L+U) 

with 


9i,n( x ) '■= in t f(y) + ndi(x,y), 

y&X 


hi,n(x) := sup f(y) - ndfix,y) 
y&X 


associated to any family of (r x r)-continuous distances dj satisfying (4.6). 

Properties (4.3) and (4.4) show that Definition 4.1 is consistent with the axiomatization of 
extended metric spaces proposed in [3] . Notice however that we assume neither that r is Polish 
(by working directly with Radon measures) nor that (X , d) is complete. 

Completion. Thanks to (4.1), extended metric-topological spaces behave well w.r.t. completion: 
denoting by (X,d) the abstract completion of (X, d), every function / in Lip fe (X, r, d) admits 
a unique Lipschitz extension / to X and we can thus introduce the topology r generated by 
{/ : / E Lip b (X, r, d)}. It is not difficult to check that (X, r, d) is an extended metric-topological 
space and that r is the restriction of r to X. 

A canonical compactification. Consider a set F and the space X := R f endowed with the 
topology tx of pointwise convergence. On X we consider the extended distance 

dx(x,y) := sup \xf - y f \ with x = (x f ) feF , y = (y f ) feF e X . 
feF 

(X,Tx,dx) provides a natural class of example of extended metric-topological space depending 
on the index set F. Extended metric-topological spaces can always be embedded in a compact 
subset of some space X = M^, preserving the metric-topological structure. As index set F and 
embedding t we can always choose 

i F 


F = Lip M (X,r,d), i 


X 


ifx) f :=f(x) feF 


(4.10) 


as is typical for the Stone-Cech compactification of completely regular spaces. The proof of the 
following result is standard. 
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Lemma 4.3. //(X, r, d) is an extended metric-topological space according to Definition 4-1, then 
the map l in (4.10) is an homeomorphism of X with X = i(X ) C X = M. F , it is an isometry 
from (X, d) to (X,dx), and X has a Tx-compact closure in X. 

One more important consequence, from the measure-theoretic point of view, is the following 
result concerning the complete regularity, according to (2.5), of metric-topological spaces. 

Lemma 4.4 (Complete regularity). Any extended metric-topological space (X, t, d) is completely 
regular. 

Proof. Let xo G U G r with U O F = 0. By condition (b), we can assume that U has the form 
fl j{fj > 0} for some finite family of functions fj G Lip & (X, r, d). Then the function g = rnirij f F 
is null on F and g(x o) >0. □ 

A first consequence of the (r x T)-lower semicontinuity of d, technically relevant for us, is the 
T-lower semicontinuity of the function 

6k{x) := mind (a;, y) (4-11) 

y&K 

for any compact set K C X. The function d k, whose 0 level set is K, provides in our context an 
analogous of the perfect regularity condition in Topology. Notice also that dx is the monotone 
limit of the r-continuous functions 


d l K (x) := mindj(x, y) . 

y£h 


(4.12) 


Let us now introduce some additional concepts where the topological and the metric structure 
interact. We denote by 3$(r, d) the cr-algebra generated by Lip [) (A, t, d); obviously ^(r, d) C 
&(t), but a kind of converse is provided by the following lemma. 

Lemma 4.5. In an extended metric-topological space (X, r, d), any p G PA[fX) is uniquely 
determined by its values on d). In addition UiLip b (X, r, d;) separates points of X and it is 
dense in L 2 (X, m). In particular Lip b (X, r, d) separates points of X and it is dense in L 2 (X, m). 

Proof. For the first statement it is sufficient to remark that the sets {d' K > 0} are r-open, belong 
to ^(r, d) and monotonically converge to {d/<- > 0} = X \ K. Then the Radon property of p 
ensures that p(X \ K) = Imp p({d l K > 0}). Since {d(-, x)} x& x separates points of X , the family 
{d i(-, x)} x& x,iei is contained in Lip^X, t, d) and separates points as well. To prove the density, 
it is sufficient to show the implication 


J ffi dm = 0 G Lip b (X, r, d ?; ) 

iei 


f = 0 . 


Clearly J f dm = 0 and, arguing by contradiction, it is not restrictive to assume f \ f\ dm = 2 . 
Splitting / in positive and negative part we can consider p F := / ± m G £P(X) to obtain that 
p + = p~ , which is a contradiction. □ 

On X D we put the product topology t ® d , i.e. the coarsest topology making all et continuous. 
We denote for simplicity this topology by r*. Notice that (X D ,t*) is Hausdorff, because {et}t&D 
separates points of X D ; if (X ', r) is completely regular then ( X d ,t*) is completely regular as 
well, since it is a product of completely regular spaces [35, Thm. 33.2], 

By continuity, the push-forward operator induced by e* maps Radon measures in X D to Radon 
measures in X. Let us now check that the class UC{D ; ( X, d)) of d-uniformly continuous paths 
77 : D X is a Borel subset of X D . Indeed, it is easily seen that its complement is described by 


unu 

k= 1 1= 1 i£l 


77 : 3s, t £ D with |s — t| < -, di(r](s),r](t)) > 


V 


k 


On the other hand, for k, I and i fixed, the complement of the set above is 


n 

s, £E.D, (t\s —£|<1 


V : \ 
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and therefore is closed in X D (by the continuity of (e s ,e*) : X D —> X x X). 

It is easy to check that d o (e s ,et) = sup, d, o (e s ,et) is r*-lower semicontinuous in X D for all 
s, t G D. This can be used to prove that is r*-lower semicontinuous in X D and, in particular, 
that AC P (D ; (X, d)) is a Borel subset of X D , more precisely a countable union of closed sets. 

Theorem 4.6 (Compactness of probabilities in X D ). Assume that p G (l,oo), T > 0 and 
D C M is countable. Let (^Jig/ be a net of Radon probability measures in ( X d ,t *) such that 
for all t G D the family is equi-tight in £?(X). 

Then has limit points r) G L?(X D ) in the weak topology induced by Cb(X D ) and any 

such limit point, along a subnet fd : L -A I, satisfies: 

J dr](r]) < lim inf j $4,' p (rj,D') dr] m (r)) MB’ C D . (4.13) 


Proof. The family {ry,}, e / is equi-tight: indeed, enumerating by {d k }keN the elements of D , it 
suffices to find compact sets Kk, n such that sup i {e ( i k )^r] i {X\Kk, n ) < 2~ k ~ n and to consider the 
sets 

OO 

Tn := H {V € X D : r,(d k ) € I< k , n } 

k =0 

which are compact in X D and satisfy supjT 7 , : (X D \ T n ) < 2 1 ~ n . It follows that we can apply 
Theorem 2.1. 

In order to prove (4.13) for a limit point r] G LP(X) we use the continuity of rj eA d i(jj(s),rj(t)) 
and the Radon property of r/ to get 

J d p (??(s), ri(t)) dr/(ri) < liminf J d p (r/(s), ? ? (t)) dr] m {g) . 

Then, given D' C D, from the superadditivity of liminf we obtain, for all choices of to,... ,t n G 
D 1 , 


[y ' d p {r](t j+ 1 ),r](tj)) 

J (* j+i “ tj) p ~ l 


dv(v) 


< 



(tj+i-tjy - 1 


d vm^) 


< hmmf j s/ p {ipD') drj m (r]) . 

Since the sums d p (i/(^'+i), ? 7(^'))/(^j+i ~ij) p ~ l become larger as the partition gets finer, we 
can use once more the fact that ry is Radon to conclude. □ 


Finally, we can add a measure structure in extended metric-topological spaces as follows. 


Definition 4.7 (Extended metric measure space). ITe say that ( X , r, d, m) is an extended metric 
measure space if: 

(a) [X, t, d) is an extended metric-topological space; 

(b) m G PA{X), i.e. tn is a Radon probability measure in (X,A${t)). 


5. The Wasserstein space over an extended metric-topological space 

Throughout this section (X, r, d) is an extended metric-topological space. 

5.1. The extended Wasserstein distance between Radon measures. In the class &{X) 
of Radon probability measures in X, we define the (quadratic) Wasserstein extended distance 
W-'d (M, v) by 

lT d 2 (/i, u) := inf | [ d 2 d7t : 7r £ T(/i, u)l (5.1) 

{JxxX J 

where we recall that T( / u, u) is the class of admissible transport plans between p, and u, see (2.8). 

In our context, since ( X , r) is completely regular thanks to Lemma 4.4, we know that narrow 
and weak topology coincide. From now on, unless otherwise stated, by weak convergence we 
always mean convergence in the duality with Cb , the corresponding topology will be denoted by 
T0>- 
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The lower semicontinuity of the cost d 2 and the tightness of the marginals ensure respectively 
lower semicontinuity of the transportation cost and compactness w.r.t. weak convergence of the 
class of the admissible transport plans T(/x, v), hence existence of optimal plans. We provide a 
more general statement in the next theorem. 


Theorem 5.1 (Compactness and joint lower semicontinuity). Let I be a directed set and assume 
that Hi, Vi G &{X) weakly converge to p, v G &(X) respectively. Then, for any choice of 
7 Tj € F(ni,Vi), one has: 

(a) the net (7 Tj)j e / has limit points w.r.t. weak convergence and any limit point 7r belongs to 
T(p.,v); 

(b) if 77 i weakly converges to 7T, and q : X xX —>• [0, 00 ] is a monotone family of (TXr)-lower 
semicontinuous functions, then 


lim inf 

iei 


a d7Tj > 


C d 7 T 


with c := supq ; 

i 


(c) if Hi = h> u i = v, and j/d,;:IxlA [ 0 ,00] is a monotone family of (r x r)-continuous 
distances with liirij dj = d, then W c \ t monotonically converges to W 4 . 


Proof. Statement (a) is a direct consequence of Theorem 2.1, since (X x X,t x t) is completely 
regular. Statement (b) follows by (2.3). Statement (c) follows by (b) and (a) with Hi = hi 
Vi = v, choosing optimal plans 7q relative to d 2 and extracting a weakly convergent subnet from 
the 7 Tj. □ 


We claim that the duality formula 

\w%{h,v) = sup f Qi4>dv- f 4> dp (5.2) 

z (pcT.J J 

holds, where J- is defined in (2.11) and Qtf> is defined by the Hopf-Lax formula 

Qt4>{y ) := inf 4>(x) + ^-d 2 (x,y) . (5.3) 

xex zt 

In fact, from the very definition of T in (2.11) it follows that 

Qt<f(y) = C A min (j)(x) + ^d 2 (z, y)) (5.4) 

for some compact set K C X with (f> G C(K ) and C > max^ </>, hence Qt4> are r-lower semi¬ 
continuous and we can replace f*Qi4> dv with f Qi<j> dv. In addition, the compactness of K 
ensures that 

min (j>(x) + — d 2 (.x,y)) f min (j>(x) + —d 2 (x,y)) 

hence from Proposition 2.3 (which deals with bounded cost functions) with ^d 2 and statement 
(c) of the previous theorem we obtain (5.2). 

We will occasionally use also the extended Wasserstein distance Wem in &(X) obtained by 
minimizing J d d7r in the class of admissible transport plans, and the corresponding duality 
formula 

Wd,i(/b*') = sup jy fd(ii-v): f G Lip ft (X,r,d), Lip(/) < l| . (5.5) 

Also the proof of (5.5) can be obtained from Proposition 2.3 with c = dj, considering the pairs 

(-4>h4>i) with 

<f>i{x) ■= C A min (f>(x) + dj(x, y)) G Up b (X, r, d) 
and (f G T, K C X compact set as in (2.11), C > rnaxx (j>. 

Proposition 5.2 (Wd convergence implies t& convergence). A net {nj}jej weakly converges 
to pi in P?{fX) if and only if 


lim 

ieJ 


/ d/i for every f G Lip(X, r, d) . 


(5.6) 
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In particular, if {pj}j & j converges to p w.r.t. Wd, then it also converges w.r.t. the weak topology 
induced by Cb(X). 


Proof. Let us first prove that (5.6) is sufficient to prove the weak convergence of pj in 2P{X) 
(the converse implication is trivial). If / G Cyf X) we consider the directed set L~(f) defined in 
(4.8), obtaining 


lim inf 

3&J 


f d pj > lim 
ieJ 



for every g G L (/) 


so that 


lim inf / / dn 7 - > sup g dp = lim 

J g£L-(f) 


g dp 


f dp 


since L (/) is a directed set with respect to the natural ordering of functions and p is a Radon 
measure. Changing / in —/ we get the opposite inequality for the lim sup, thus proving that 
liinjej f f dpj = f f dp. 

If {pj}jeJ is a ne t convergent w.r.t. Wd to p G IP(X) the inequality (ensured by Wg,\ < Wd 
and (5.5)) 


/ 


f dp 


f d pj 


< Lip (f)W d (pj,p) 


yields (5.6) and therefore the weak convergence of pj. 


(5.7) 

□ 


The following result shows the flexibility of our axiomatization: the extended metric-topological 
structure can be lifted from the space X to the space of probabilities on X. 


Proposition 5.3 (Extended metric-topological structure on &(X)). (P?(X),Tgt,Wd) is an 
extended metric-topological space. 


Proof. Let us first show that condition (a) of Definition 4.1 is satisfied. 

We consider the set F = Lip fe l (X, r, d) and we denote by I the collection (directed set) of the 
finite subsets of F ordered by inclusion. For every i G I we can set 

Mi := sup sup |/(x)| dj(x,y) := sup|/(x) - f(y)\ . 
xgx fei f& 

Since we already noticed that Wd, f Rd, it remains to prove that each semidistance Wd, is 
(t& x r^>)-continuous. So, for a fixed i = {/i,..., /jv} C F we consider the r-continuous map 
k : X —y Xi = [—Mi, Mf\ N and the distance 5 in X. t defined by 

k(x) := (/i(x),... ,f N (x)), S(x,y) := sup \x n - y n \, x,y e R N , 

n=l,...,N 

so that k is 1-Lipschitz. Since in fP{Xi) the weak topology coincides with the topology induced 
by the Wasserstein distance W$, kj is continuous from [PP{fX), t@>) to (,^(X,;), Wg)’, it is therefore 
sufficient to prove that 


W d , (p, v) < Ws(k^p, k$v) for every p, v G 3*(X) (5.8) 

which in fact yields the equality, since the opposite inequality is trivial. To prove (5.8) we apply 
Proposition 2.3 with c = df to find a sequence of uniformly bounded Borel functions f n , f> n 
such that 


^n(y) ~ K{x) < df (x, y) for every x,y e X, W d 2 .(/x, v) 


lim 

n—> oo 


f> n du 


fn dp . 


Possibly replacing f) n by <fF n and <f> n by <f^ and using the fact that c-concave functions are d ; - 
Lipschitz (and therefore r-continuous) we may assume that € Lip fe (X, d,;). We can then 

define functions f n , 'fn ■ n(W) —>• M by 


4> n (n(x)) := (j) n (x), f3 n (n(x)) := ifnix) x e X 


the definition being consistent since k(x) = n(y) yields d i(x,y) = 0 and therefore 4> n (x) = f> n (y) 
and f>n{x) = V’n(y)- Since moreover 5(k(x), n(y)) = d j(x,y), we can easily prove that <f> n , 'fn are 
Lipschitz continuous w.r.t. 5 in n{X), and therefore they admit a unique Lipschitz continuous 
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extension (still denoted by < j> n , ijj n ) to the compact set ac(X). Notice that both the supports 
supp(Kj^) and supp(Kjjp) in X* are contained in k(X). Moreover the relation 

V’n(y) - 4>n{x) < S(x, y) for every x, y E n{X) 


extends by continuity to k(X), so that 

Wg(K (t/x, Kfjiy) > J if n d/cp - J 4> n dn$y = J dv - J <f n dy 


proving (5.8). 

In order to prove that also condition (b) of Definition 4.1 is satisfied, we just observe that the 
family T of real functions on <SP(X) of the form 


f\y\ : y i-A 


/d/r, / € Lip M (X,r,d) 


is included in Lip b l (^(X), r^,Wd) thanks to the very definition of weak convergence and to 
(5.7). On the other hand, Proposition 5.2 shows that t& is induced by T. □ 


Proposition 5.4 (Completeness of (^(X),W d )). If (X, r, d) is an extended metric-topological 
space and (X, d) is complete, then also (<^(X),ILd) is complete. 

Proof. Let (y n )n£N C 7^(X) be a sequence satisfying ^2ff = iW d (y n , y n +i) < oo. We argue as 
in the proof of [2, Prop. 7.1.5]: if 7v n E T(y n , y n+ \) are optimal transport plans, by applying 
Lemma 2.2 we can find a Radon measure 7r E &(X), X = X N , such that (p n , p n+1 )jj7r = 7r n , 
n € N. 

We thus have J2n=i I d(p n ,p n+1 ) d7r < oo so that the sequence n eA p n {x) is a Cauchy 
sequence for 7r-a.e. x £ X. Denoting by p(tc) its pointwise limit, and applying Egoroff Theorem 
[39, Thm. 6, p. 28] we can find for every e > 0 compact sets K e C X with n(X \ K e ) < e such 
that the restrictions of p n to K e converge uniformly, i.e. 

lim sup d(p n (*), p(®)) = 0 . (5.9) 

n ^°° x&K e 

Let us prove that p is a Lusin 7r-measurable map by showing that the restriction of p to each 
K e is continuous: the latter property will be a consequence of the fact that d-uniform limit of 
r-continuous maps is r-continuous. In fact, for every x € K e and every r-neighb our hood V of 
p(aj) in X, we may find functions fj E Lip b x (X, r, d), j = 1,..., J, and 5 > 0 such that 

\fj{x) - fj(y)\ < 5 for every j = 1,..., J y^V. 

If n E N is sufficiently big so that sup zgA ' e d(p n (^), p (z)) < 5 /3 and U is any neighbourhood 
in X such that every y E U satisfies \fj{p n (x)) — fj(p n {y))\ < 5/3 for every j = 1,... ,N, we 
conclude that for every y E U (~l K e 

l/i(p(*)) - fj(p{y))\ < 2 sup d( P (z)), p n (z)) + |/j(p n (*)) - fj(p n (y))\ 

z£K s 

< 5 for every j = 1,..., N , 

so that p (y) E V. 

Being p a Lusin 7r-measurable map, y := p^7r is a Radon measure in X and 

« OO « OO 

W d (y n ,y) < ( / d 2 (p n ,p) d7r) 1/2 < ^ (/ d 2 (p m ,p m+1 ) d7r) 1/2 = ^ W d (y m ,y m + i) , 

J m=n J m=n 

which shows that lim n ^. 00 W d (y n , y) = 0. □ 


5.2. The superposition principle for extended metric-topological spaces. The next 
proposition is a small variant of the superposition principle recently proved in [34, Thm. 3.1] 
for extended metric measure spaces. We provide a slightly different proof, since no Polish 
assumption on (X, r) is made here, only the complete regularity of r following by Lemma 4.4 
plays a role. 
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Proposition 5.5 (Superposition). Assume that (X, d) is complete and let fit € AC 2 ([0, T]; (^(X), Wj)). 
Then there exists rj £ hP(X^' T }) concentrated on AC 2 {[ 0,T];(X, d)) with ( et)#rj = fit for all 
t £ [0, T] and 

J \h{t )\ 2 drf(rf) = \fi t \ 2 for a.e. t £ (0 ,T) . (5.10) 

Proof. Let us assume for simplicity T = 1. We set D = U n D n with D n = {j/2 n : 0 < j < 2 n }. 

Step 1. We build <r £ h?(X D ) concentrated on AC 2 (D; (X, d)) satisfying ( et)#cr = fit. for all 
t £ D and 

/ E l^n-^ + 0/201- , P /2 " |AT dr Vi, j e { 0,... ,2"}, z < j, n > 1 . 

J k=i 1 Ji/2 n 

(5.H) 

To this aim, taking Theorem 4.6, (4.13) and the fact that D n f D into account, it is sufficient 
to build a family of approximations cr n £ &(X D ) satisfying (et)#cr n = fit. for all t £ D n and 

• J_1 \rf(k/2 n ) — rj((k + l)/2 n )| 2 

- u u n yif) ^ 

li/2 n 

(5- 12 ) 

The construction of a n is a simple application of Lemma 2.2: it suffices to choose optimal plans 
7 r.j from fii/zn to iiu + \)/ 2 n i 0 < i < 2 n , and then use the lemma to find a Radon probability 
measure 7r in X 2 " +1 having 7T,; as double marginals. Then one can define cr n £ hP(X D ) as the 
push forward of 7r via the continuous map from X 2n+l to X D defined by 


E 

k=i 


A?/ 2 " 

dcr n (rf)< \fi r \ 2 dr Vi, j £ {0,..., 2 n }, i < j . 

Ji/2 n 


(.To, • • ■ ,x n ) Tj(t) = 


Xi if < t < ^r, 0 < i < n - 1 
x„ if t = 1 . 


Step 2. Given a as in Step 1, we notice that er is concentrated on the union of the closed 
sets T k = {rj : 3/2 (rf,D) < k}. Since {X, d) is complete we can consider the extension map 
ext : UfcTfc —>• v4d7 2 ([0,1]; (^L, d)) and build 77 as the image under er of ext. To show that rf 
is well defined and it is a Radon measure in <^(A'[° ,T 1) we need to show that ext (arbitrarily 
defined out of U k Tk) is Lusin er-measurable. To this purpose, it is sufficient to prove that 
ext : Tfc —>• X^ 0,T ^ is continuous. Let (//j),; e / be a net in convergent to 77 and let rj t , rj be the 
corresponding extensions to [0,1]. By the definition of product topology we need only to prove 
that rji(t ) —>• rj(t) in ( X , r) for all t £ [0,1]. Since r is generated by L\p b (X, r, d) we need only to 
prove that f(rji(t )) —> f(jj{t)) for all / 6 Lip fe (X, r, d). This is trivial if t £ D (because rft —>• r] 
in X D ), in the general case one can use the estimate 


I f(Vi(t)) ~ f(m(s ))I < Lip(/)d(? 7 j(t),7/j(s)) < Lip(/) v / L|t - s\ 
and the analogous one for rj to conclude. 

Having proved that rf is well defined ^(X^-°' T ^) and it is concentrated on the Borel set 
AC 2 ([ 0,1]; (X, d)), we notice that by construction one has (et)#rj = fit for all t £ D. On 
the other hand, fit £ AC 2 {[t), T]; (&(X), Hd)) implies, thanks to Proposition 5.2, that t ha fit is 
continuous w.r.t. Tg>. Since d-convergence implies r-convergence also t i-A (■ et)#V is continuous 
w.r.t. t^z, therefore {et)#rf = fi t for all t £ [0,1]. An analogous approximation argument gives 

t- ft. 


£/ 2 (v,[s,t]) drfjrf) < / \fi r \ dr Vs, t £ [0,1], s < t 


starting from (5.11). 

Finally, from (5.13) and Fubini’s theorem we get 

/ \v(t )\ 2 cb?( 7 ?) < \fit\ 2 


for a.e. t £ ( 0 , 1 ) . 


(5.13) 


(5.14) 


On the other hand, since (e^e*)#^ £ T (fi s ,fi t ) one has 


Wj<yfi s: fit) < j d 2 {rf{s),rf{t)) d 77 ( 77 ) <{t-s) J j \f)\ 2 (r) dr drj(rf) , 
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hence for a.e. t £ (0,1) the converse inequality to (5.14) holds. □ 

6. Cheeger energy and minimal relaxed slope 

Throughout this section (A, r, d,m) is an extended metric measure spaces according to Defi¬ 
nition 4.7. In this section we provide basic calculus results already developed in [3], with minor 
variants in the definitions that do not really affect the proofs. 

For / £ Lip b (A, r, d), the asymptotic Lipschitz constant Lip a (/, x) : X —>• [0,oo] is defined by 

Li Pa(./» = iimLiPal/^T) with Lip a (/,x,r):= sup —^^-,(6.1) 

d(y,x)\/d(z,x)<r,d(y,z)>0 "\lJi %) 

and with the usual convention Lip a (f, x) = 0 at d-isolated points x. By construction the function 
Lip a (/, •) is d-upper semicontinuous. In the standard case when d is a finite distance and r is 
the metric topology it follows that Lip a (f, •) is also T-upper semicontinuous. 

Definition 6.1 (Cheeger energy). For all f £ L 2 (X, m) we set 

Ch(/) := infliminf [ g 2 dm, D( Ch) := {/ £ L 2 (X, m) : Ch(/) < oo} , 
n— >oo J 1 ‘ 

where the infimum runs among all sequences ( f n ) C Lip b (A, r, d) with lim n f \ f n — f | 2 dm = 0 
and all m-measurable functions g n > Lip a (f n , ■) m-a.e. in X. 

Motivated by the previous definition we may define, for / £ Lip(X, r, d), Lip*(/, •) as the 
(essential) least upper bound of all m-measurable functions larger m-a.e. than Lip a (/, •). Then, 
Ch we can be equivalently defined by minimizing liminf„ f (Lip*(/ n , -)) 2 dm among all sequences 
(f n ) C Lip b (X, r, d) with lim n f \f n - /1 2 dm = 0. 

The concept of minimal relaxed slope is closely related to the definition of Ch. First, one 
defines relaxed slope of / any function G > g, with g weak L 2 (X, m) limit point as n —> oo of 
Lip*(/ n , ■), where f n € Lip b (X, r, d) and f n —> / in L 2 (X, m). It can be proved ([3, Lem. 4.3]) 
that the class of relaxed slopes is a convex closed subset of L 2 (X, m), not empty if and only 
if / £ .D(Ch). The minimal relaxed slope, denoted |D/|„ (and occasionally by |D/|„, iC | to 
emphasize its dependence on d), is the relaxed slope with smallest L 2 (X, m) norm. 

In analogy with the classical case, for all / £ L 2 (X, m) with dCla(f) / 0 we denote by A/ 
the element with minimal L 2 (X, m) norm in d ^ Ch(/). 

We now recall some basic calculus rules and more precise relations between Ch and the 
minimal relaxed slope. Properties (g) and (h) below involve the notion of test plan, recalled 
below. 

Definition 6.2 (Test plan). We say that r] £ X l 0 ’ 1 !) is a 2-test plan (relative to mj if r] is 
concentrated on AC 2 ([0, 1]; (X, d)) and there exists C € [0, oo) satisfying (et)#r) < Cm for all 
t £ [0,1]. The least constant C with this property will be denoted by C(rj). 

Proposition 6.3. The following properties hold: 

(a) For all f. g £ D(Ch), a, (3 £ R 

|D (af + (5g)\ w < |a| |D f\ w + \/3\ |D 5 | W ; (6.2) 

in particular Ch and (Ch) 1 / 2 are convex and lower semicontinuous functionals in L 2 (X, m), 
with a dense domain. 

(b) For all f £ D(Ch) one has Ch(/) = f |D/|^ dm and there exist f n £ Lip fe (A, r, d) with 
fn t f in L 2 (X, m) and Lip*(/„,•) -A |D f\ w in L 2 (X, m). 

(c) ID/L = iDg'L m-a.e. in {/ = g} for all f,g€ D(Ch). 

(d) |D/| W < Lip*(/, •) m-a.e. in X for all f £ Lip 6 (A,r,d). 

(e) Dd»(/)L = |</> , (/)||D/L m-a.e. in X, for all f £ D(Ch) and : M —>• R Lipschitz. 

(f) / fAg dm < / \Bf\ w \Bg\ w dm for all f £ D( Ch), g £ D{ A). 

(g) If r] £ ^(A"! 0,1 ]) is a test plan, then for all f £ D(Ch) one has 

l/(??(l)) - f(v(0))\ < [ |D/|^(t 7 (s))|t)(s)| ds for r]-a.e. T) . 

■Jo 
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(h) If rj G ^(A"[ 0,1 l) is a test plan, then for all f E D( Ch) one has 

r \fm)-mo ))\ 2 


lim sup 
40 


mv )) 2 

where E t (r]) := \J t |?)(s)| 2 ds. 


dr) < / \Bf\ w ( V (0)) dr)(rj) , 


Proof. The properties from (a) to (e) are proved in Lemma 4.3, Lemma 4.4, Theorem 4.5, 
and Proposition 4.8 of [3]. The proof of (f) relies on the convexity inequality |D(/ + eg)\ w < 
|D f\ w + e|Dg| with e > 0, see Proposition 4.15 of [3]. Property (g) is proved in Theorem 5.14, 
Corollary 5.15 (see also Corollary 3.15) of [3], using Mazur’s lemma, property (b) and the upper 
gradient property of the asymptotic Lipschitz constant. Finally, (h) follows by (g) and Holder’s 
inequality, which give 


I/M*)) ~ /(*7(0))I S 

(E t (v)) 2 


dr) < 


|D/|2 d(e s ) # / 7 ds . 


□ 


Corollary 6.4. If(X, d) is complete, for all g E -D(Ch) and all g,t = pt *n E AC 2 ([0,T]; (IP(X), W^)) 
with sup t \\pt Hz/Wx.m) < 00 one has 

\ 1/2 


J gp T dm - j gp 0 dm <| (^j \P>g\ 2 w p t dmj | fi 


dt 


Proof. By Proposition 5.5 we can find r) € concentrated on AC 2 ([0,T]; (X, d)) with 

(et)#V = ptxn for all t E [0, T] and satisfying (5.10), so that Proposition 6.3(g) gives 


g(r)(T)) < g(r)( 0)) + / \Bf\ w (r](t))\i){t)\ dt for 77 -a.e. g 

■J 0 

By integrating this inequality and using Holder’s inequality with t fixed together with (5.10) the 
proof is achieved. □ 


In the sequel we denote by 

F(p) := 4Ch( v / p) , pGL\(X,m) 

the so-called Fisher information functional. Let us recall its main properties (see [3] for the 
simple proof). 


Proposition 6.5. F is a convex and L 1 -lower semicontinuous functional in L+(X, m). If y/p € 
.D(Ch) we have the equivalent representation 


F (p) 


|Dp| 


'{p>0} P 


dm . 


We denote by P* the L 2 (X, m) (metric) gradient flow of the convex and lower semicontin¬ 
uous functional | Ch. Since D( Ch) includes Lip b (X, r, d) which is dense in L 2 ( A, m), P* is a 
contraction semigroup in L 2 ( A, m), characterized by 

^P tf = AP t f for a.e. t > 0 . (6.3) 

Besides the general properties of gradient flows in Hilbert spaces listed in Proposition 3.4, we 
recall that P< satisfies the maximum and minimum principle (i.e. if c < f < C m-a.e. in X, 
then c < P tf < C m-a.e. in X for all t > 0) and that 


dt 


fP t p dm 


<F 1/2 (M(/ |D/| 2 P f pdm) 7 


for a.e. t E (0, 00 ) 


(6.4) 


The maximum and minimum principle can be derived from Proposition 6.3(e), while (6.4) is a 
direct consequence of Proposition 6.3(f) and of (6.3). 
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Note also the following fact (whose proof can be obtained by a simple regularization argument, 
since m is finite, see [3]): for all / G L+(X, tu) the function t ha j f t log ft dm is absolutely 
continuous in [0, oo) and 

/tlog/t dm = — f dm for a.e. t > 0 . (6.5) 

J{ft> 0} Jt 

In particular, the right hand side is locally integrable in [0, oo). 

7. Extended distances in <?> a (X) 

In this section we introduce a class of absolutely continuous curves in an extended metric 
measure space (X, r, d,m), following the analogy with [2, Thm. 8.3.1], [9], [27]. 

We first introduce a Banach structure on two Sobolev classes of test functions, D(Ch) and 
the algebra 

Alh := {/ € D( Ch) : /, |D/| W G L°°(X, m)} (7.1) 

which obviously includes Lip b (X, r, d) and it is dense in L 2 (X, m); a simple truncation argument 
also shows that .Ach is dense in L P (X, m) for every p G [l,oo). 

Since / i —> yj Ch(/) is convex in D( Ch) and the function (x,y) ha \/ x 2 + y 2 is a norm in M 2 , 
it is easy to check that 

ll/llch := (ll/lll + Ch(/)) 1/2 = ( J (\f\ 2 + \Bf\l) dm) 1/2 , (7.2) 

is a norm in D (Ch); the lower semicontinuity of Ch with respect to F 2 -convergence also shows 
that (-D(Ch), || • ||£i(ch)) i s a Banach space. Similarly, .Ach is a Banach algebra w.r.t. the norm 

ii/iu h : =ii/iioo+in D /uL. (7.3) 

H(Ch) and the algebra *4ch are not separable in general, but since their norms are lower semi- 
continuous w.r.t. the L 2 convergence, they are F a and thus Borel subsets of L 2 (X, m). When 
we will consider measurability of maps (f> with values in H(Ch) or A.ch> we will always refer to 
its Borel <r-algebra inherited from the L 2 topology. 

7.1. The dynamic approach and the continuity inequality. 

Definition 7.1 (Continuity inequality). Given a family of probability densities pt, t G [0, T], 
we write pt G CE 2 (X, Ch, m) if there exists cG L 2 (0, T) satisfying 

< c(r)(/| D /|^p r dm) dr (7.4) 

for all f G -4.ch and all 0 < s <t <T. The least c in (7.4) is denoted ||/9[||. 

For simplicity of notation we do not emphasize the T dependence in the previous definition. 
We will often deal with the case when pt are essentially bounded, uniformly w.r.t. time: in 
this case, by using the density of ^4ch in -^(^vn), it is then easy to check that t i-a p t is 
weakly* continuous with values in L°°(X, m) (see also the next Theorem 7.3); we will write 
P GC wt ([0,T];f”(I,m)). 

Remark 7.2. It is a direct consequence of (6.4) and (6.5) that f G ft := P tp belongs to 
CE 2 (X, Ch, m) for all p G L 2 + (X, m), with 

H/4H 2 < f \^Pt\ w for a.e. t G (0,oo) . 

J{pt> o} Pt 

In the following theorem we show a “differential” characterization of absolutely continuous 
curves in (&(X), Wf), which provides a key link between the metric and the differentiable 
viewpoints. 


/ fp t dm - J fp s dm 
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Theorem 7.3 (Differential characterization of absolutely continuous curves). 

For all pt € CE 2 (X, Ch,m) one has pt = PtVX € AC 2 ([ 0, T]; (£P(X), Wd)) and 

I At I < 11/411 /op a.e. i E (0,T) . (7.5) 

Conversely, if (X, d) is complete, pt = />ttn E -AC 2 ([0, T]; (/P(X), Wd)) and sup 4g [ 0 T ] ||/Ot||oo < 
oo, then pt € CE 2 (X, Ch, tu) and 

H/4II < | At I f° r a - e - t € (0, T) . (7.6) 


Proof. We show the second part of the statement and (7.6). Let pt = pt.m € AC 2 ([ 0, T]; (/^(X), W^)) 
with essentially bounded densities pt. 

The inequality (5.7) shows that t / / d/q is absolutely continuous in [0, T] for all / € 
Lip & (X, r, d). In addition, Corollary 6.4 provides the inequality 



fPt dm 


f 


< / lArll / \Vf\lPr 


dm j 


1/2 


dr 


for 0 < s < t < T. By the density of Lip b (X, r, d) in „4.ch provided by Proposition 6.3(b) the 
inequality extends to all / E -4-ch- 

We provide a proof of the converse implication and the converse inequality (7.5), along the 
lines of [32], assuming for simplicity T = 1. First we notice that the property pt E CE(X, Ch, m) 
is stable under convolution w.r.t. the time parameter: more precisely, if we extend t eA pt by 
continuity and with constant values to (—oo,0) U (l,oo), then p £jt '■= pt * Xe still belongs to 
CE(X, Ch, m) and ||/4 t|| 2 < ||/41| 2 *Xe- For this reason, in the proof of this implication we can 
assume with no loss of generality that t i-A pt is continuous w.r.t. the L 1 (X, m) topology. We 
start from the duality formula (5.2). Let f> : X —>• [0, oo) be such that <f> E C(K ) and 4> = max^ 
on X \ K , with I\ C X compact not empty. Under this restriction on <f, we have already seen 
that Q e (j) can be represented in the form (5.4), and that Q e (/) is d-Lipschitz, Borel (because it 
is t -lower semicontinuous), nonnegative and bounded. In addition Q s (j> f f> and Qi{Q £ 4>) t Qi'f 
as e 4- 0. 

Set now ip := Q e cj) for some e > 0 and observe that QtP, t E [0,1], are uniformly d-Lipschitz 
and that the map t i —> Qt<p is Lipschitz from [0,1] with values in L°°(X, m). By applying [2, 
Lem. 4.3.4] to the function (s, t) eA f p s QtP dm we obtain that t i-a f ptQtP is absolutely 
continuous in [0,1] and that its derivative can be estimated from above by 


1 


limsup 

s—>t I S 1 1 

Using the inequality 


(Ps - Pt)QtP dm 


1 


+ lim sup 

s—>t I S t\ 


J ptQ t ip dm - J p s QtP dm 


PtiQsP - QtP) dm 


11/411 (/ WMwPr dmj dr 


< 


we estimate the first limsup, at Lebesgue points t of s eA \p' s \, by 

U\ 


\ 1/2 

\VQMlPt dmj 


(here we used also the strong continuity of s i —> p s ). Estimating the second limsup with Fatou’s 
lemma and using Proposition 6.3(d) gives 


■* /*1 
(piQpp-pop) dm<y 


U\ 


where is the bounded Borel function 


£t ■= limsup 

s—yt 

Now we use the pointwise subsolution property 


(Lip ■ )) 2 Pt dmj 
Q s (p - Q t <p 


\ 1/2 


+ / Ptft dm 


d t 


s — t 


1 


(Lip*(Qt</v )) 2 < -ft 


m-a.e. in X 


(7.7) 
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for a.e. t G (0,1) (whose proof follows as in [3, Thm. 3.5], where it is stated in a weaker form with 
the slope in place of the asymptotic Lipschitz constant, see also [6]) and the Young inequality 
to get f (piQi<p — pap) dm < \ f Q \\p' t \\ 2 d t. Remembering that ip = Q £ 0, we can let e 4- 0 and 
use the arbitrariness of 4 > to get 

W$(pim,p 0 m) < [ \\p' t \\ 2 dt . 

Jo 

By applying this inequality to a rescaled version of p we obtain (pttn, p s tn.) < (s—t) f t s \\p' r \\ 2 dr 
for all s, t G [0,1] with s > t, so that by differentiation the inequality (7.5) follows at a.e. 
t G (0,1). □ 


Using the continuity inequality we can define an extended “Wasserstein-like” distance Wch 
in PP a (X ) in the same spirit of the Benamou-Brenier formula: 

WchG°otti, Pim) := inf ( / ||p(|| 2 df : p t G CE 2 (Y, Ch, m)l . (7.8) 


This definition is also natural in view of Remark 7.2. Even though it is conceptually convenient 
to think to Wch as an extended distance in & a (X ), we occasionally adopt we the simpler 
notation VEch(po>Pi)) he. we identify measures in & a (X) with their densities w.r.t. m. The 
same remark applies to the other distances in & a (X) we shall introduce. Now we provide a few 
basic properties of Wch- 


Proposition 7.4 (Properties of PUch)- 

(a) (& a (X),Wc h) is an extended length metric space, and Wch > Rd- 

(b) Assume that p™ = pf m G &{X) satisfy pf —>• pt weakly in L 1 ( X, m) for all t G [0, T} and 
that p n G CE 2 (Y,Ch,m) with ||(p n )£|| uniformly bounded in T 2 (0,T). Then, if c(t) is 
any weak limit point of ||(p n )t|| as n —» oo, one has pt G CE 2 (Y, Ch, m) with ||p(|| < c(t) 
for a.e. t G (0, T). 

(c) Wq h is jointly convex in (J^ >a (X)) 2 . 

Proof. From (7.5) with T = 1 we obtain that Wd(p, v) < Wch(p,v) whenever p, v G P? a (X). 
This yields immediately that Wch(,P,v) = 0 implies p = v. The proof of the triangle property 
of PUch follows by a standard concatenation argument, noticing that for any T > 0 one has 

W C h(po,PT) ■= inf | J \\pt\\ dt : p t G CE 2 (Y,Ch,m)| . (7.9) 


The length property also follows directly from (7.9), while the proof of (b) is a direct consequence 
of a passage to the limit as n —>• oo in (7.4). 

In order to prove (c), notice that a convex combination of (7.4) written for p t , a t G CE 2 (Y, Ch, tn) 
gives 


< 


< 


/((1 - a)p t + aa t ) dm - J /((1 - a)p s + aa s ) dm 

\ 1/2 

+ alia' 


J^(l-a)\\p' r \\(^j \Bf\lp r dnij + a\\a' r \\ (^J |D/| 2 a r dmj dr 
J \/(l - a)\\p' r \\ 2 + a||a'|| 2 ^y |D/| 2 ((1 - a)p r + aa r ) dm^j dr 


for 0<s<t<T. It follows that 


Wch 


((1 - a)pi + aai, (1 - a)p 0 + aa 0 ) < / (1 - a)|| / o(.|] 2 + a||a(|| 2 dr 

Jo 


and, by minimizing, we conclude. 


□ 


In the following corollary we reverse the inequality Wd < ITch on probability measures with 
density in L°°(X, m), at the level of absolutely continuous curves and metric derivatives. 
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Corollary 7.5 (Equality of metric derivatives). Assume that (X , d) is complete and let ( Pt)te[o,T } 
be a curve of probability densities with sup ig j 0 T ] 11p*11oo < °°- Then, for p t = ptVX, one has 

fit G AC 2 ([0,T];(^(X),IU d )) <=*► fit G AC 2 ([0,T]-,(^(X),W Ch )) 

and the corresponding metric derivatives coincide a.e. in (0, T). 

Proof. The implication 4= is obvious, because Wch > Wj. In order to prove the converse one, 
hrst apply the first part of the statement of Theorem 7.3 to obtain p t G CE 2 (A, Ch, m) and \\p' t \\ < 
\fi' t \ G T 2 (0, T). By the very definition of Wch, this implies fit G AC 2 ([0, T]; (&(X), Wch))- The 
coincidence of the metric derivatives is a simple consequence of (7.5), (7.6). □ 


7.2. A dual distance induced by subsolutions of the Hamilton-Jacobi equation. We 

close this section by introducing another “dual” extended distance ITch,* in £? >a (X), motivated 
by the analogy with the dual formulation of the optimal transport problem, the inequality 
Qif(x) — f(y) < 2 (x,y) and the subsolution property (7.7) of Qtf- 

Definition 7.6. For po = po^t, p i = P\ tit G F? a (X) we define 

Wc h Jp 0 ,Pi) ■= 2sup / (0i/oi - f 0 Po) dm , (7.10) 

<j> J 


where the supremum runs in the convex set of all the bounded Borel maps f>{t, x) 
satisfying f> G C w *([0,1];L°°(X,m)) (~l T 1 (0, l;D(Ch)), and 

—(ft + <0 in (0,1) x X, in the duality with Ach • 

Q6 A 

The inequality (7.11) has to be understood as 


_d 

dt 


(,b t if dm + - / ip\T>(j)t\w dm < 0 in T>\0, 1) 


for all if G Ach nonnegative. 


M x ) 

(7.11) 


(7.12) 


Lemma 7.7 (Equivalent admissible class of subsolutions to (7.11)). The supremum in formula 
(7.10) can be equivalently taken w.r.t. subsolutions f to (7.11) in the class f> G C7 00 ([0,1]; Ach)- 


Proof. By approximating any admissible <f in the definition of ITch,* with the functions 

4>\{t, x) := Xf(Xt + (1 - A)/2, x) 

and by letting A t 1, we see that is not restrictive to assume the existence of a < 0 and b > 1 such 
that <f> is bounded, f G C w *([a, 6]; L°°(X, m))nL 1 (a, b; D{ Ch)) and dt4>t + \T>(/)t\w/^ < 0 in (a, b) x 
X according to (7.12). Then, by mollification w.r.t. to t, which preserves the Hamilton-Jacobi 
subsolution property, we can also assume that <f> G C' oo ([0,1]; L°°(X, m)) n C' oo ([0,1]; D(Ch)) 
with G C fc ([0,1]), uniformly w.r.t. x. Under this assumption, the subsolution property 

is satisfied m-a.e. in X, for all t, which also shows that the map t i —> \D<f>t\ w is also uniformly 
bounded in L°°(X, m). It follows that f is uniformly bounded with values in Ach and strongly 
measurable with respect to the Borel sets induced by the L 2 -topology. A further convolution 
in time (or the mollification by a semigroup in the first step) shows that we can also assume 
4> G C fc ([0, l];A C h)- 

□ 


Remark 7.8 (Elementary properties of I'l'ch,*)- 

(1) By the scaling argument 4>(t,x) = 5<f{5t,x), it is easily seen that 

lU 2 . h Jpo, pi) = 2 sup 8 / (fsPi ~ <f>oPo) dm , 

J 

where the supremum runs among all pairs (<5, f) with 8 > 0 and <f> bounded Borel map 4>(t, x) = 
cf>t(x) satisfying <f> G C w *([0, 5]; L°°(X, m)) n T 1 (0, 8; D(Ch)), and 

< 0 


in (0, 8) x X , in the duality with Ach ■ 
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(2) More generally, suppose that <p G C w *([a, 6]; L°°(X, m)) n L 1 (a, 6; D(Ch)) satisfies 
-ipt H-<0 in (a, b) x X, in the duality with Alch 


(7.13) 


(7.14) 


dr" ' 2 

where i9 G C([a,b\) is a positive function. Then 

2 a(b) j (pnp b - p 0 (p a ) dm < Wc h ^(p 0 ,pi) where a(t) := J d(r) dr 
In fact, setting 

P{t) ■= a(t)/a(b), t G [a, b], j(s) := ^ _1 (s) 

so that 7 is an increasing diffeomorphism between [ 0 , 1 ] and [a, b] satisfying Y(s) = a(b)/‘d('y(s)), 
the curve <p s := a(b)<p^ s ^ solves 

|2 


so that 


——ips + < 0 in ( 0 , 1 ) x X , in the duality with Alch 

2 a(b) J ( pnp b - pqP a) dm = 2 J (pi^pi - p 0 <Po) dm < Wf^AnPi) 


(3) It is not hard to prove that ITch,* is an extended distance: indeed, the non-degeneracy 
condition follows by the inequality Wch,* > W0, proved in the next proposition. The symmetry 
property follows easily by replacing < j>(t,x) by — t,x). In order to prove the triangle 

inequality, given probability densities p, a, A, and constants 5 > 0 and S' € ( 0 , 6) we write 


25 J((f>s\ — 0 o p) dm = 26 J(4>sX — dm + 25 J((fts'C — 0 o p) dm 

■H / Ch,*(-^> <T ) + p) ■ 
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~ 2(6 — 5 ')'' L:n ’* v '’ “ ’ ' 25' 

Now we minimize w.r.t. 5' and use the identity inf( 0 i i) s~ l a 2 + (1 — s) -1 /r = (a + bj 2 to get 

25 J (f> s \ - 0op dm < (W C h,*(A,o-) + W C h,*(o‘, p)) 2 . 

By taking the supremum w.r.t. (5, 0) we conclude. 

(4) W 2 ^ * is jointly convex in & a (X) x & a (X) and l.s.c. with respect to the weak topology 
°f L l (X, m), since it is defined as a supremum of a family of continuous linear functionals on 
L l (X, m). In particular, every closed sublevel of the Entropy functional (2.12) in kX >a (X) is 
complete with respect to ITch,*- 


We can now refine the inequality between Wch and 110 , proving that ITch,* is intermediate. 
Proposition 7.9 (Comparison of Wch, llch,* and 110). 110 < ff'ch,* < IEch on (& a (X)) 2 . 


Proof. We first prove that ITch 0 ITch,*- By Lemma 7.7 we can assume that 0 G C' 1 ([0, l];Mch) 
with 0(-,x) G C' 1 ([0,1]), uniformly w.r.t. x. Under this assumption, the subsolution property is 
satisfied m-a.e. in X, for all f; in addition, for all pt G CE 2 (Y0Ch, m), the Leibniz rule and a 
density argument easily give that t i-G f 4> t pt dm is absolutely continuous in [ 0 , 1 ], and that 


d_ 
d t 



dm 


<pt-rPt dm + 
at 


dt 


for a.e. t G (0,1). By the Young inequality, it follows that 


d_ 

dt 


<kPt dm 


< \y,\ 


for a.e. t G (0,1) . 


By integration in (0,1) and by minimizing w.r.t. pt the inequality follows. 

Now we prove that Uch,* — U0. Let (dj) be an increasing net of bounded and (r x r)- 
continuous semidistances with d; f d. Taking Theorem 5.1 into account, it is sufficient to fix i 
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and prove that W c ^ < Wch,*- I n order to prove this property, taking (5.3) and the comments 
immediately after into account, it suffices to show that 

■= inf 4>{y) + 7 Udi(*,y) 

yex zt 

is admissible in (7.10) whenever <j) is bounded and d ? -Lipschitz (thus r-continuous). This follows 
combining the subsolution property (see (7.7) and the comments after it) 

Ql4>(x) - Q\<t>( x ) , 1 , 


lim sup ■ 

s—yt 


s — t 


+ 2 ( Li Pa,d,(®> x )) < 0 m-a.e. in X 


satisfied by Q l t cj) for a.e. t > 0 with the inequalities 

\DQ\(j)\w{x) < Lip* id (Q^,x) < Lip* d . (Ql<f>, x) 


m-a.e. in X 


Remark 7.10. One can also introduce the “dual” L 1 transport distance TTch,*,i : 

Wch,*,i(Po,Pi) := sup / cj)(pi - p 0 ) dm , 


□ 


(7.15) 


where the supremum runs over all bounded and Borel maps </> € D(Ch) with |D0| m < 1 m-a.e. 
in X. It is not hard to see that 

W Ch ,*,i( Po ,Pi) < Wch,*(po,Pi) ■ 

Indeed, fix <f> with |D0| W < 1 and put 4>t(x) = — + cf>(x), which is admissible in the definition 

of ITch,*- Now for 5 > 0 we have 

/ p 8 \ 8 

<Kpi - Po) dm = / (fopi - <l>oPo) dm + - < 7 ^ w Ch ,*(Po, Pi) + ^ ■ 

Optimizing in 5 we hnd f cj)(pi — po) dm < VEch,*(P0; Pi) and taking the supremum over cj) yields 
the claim. 

8. Identification of gradient flows 

In this section we compare the metric gradient flows of Ent w.r.t. to the extended distances 
Wd and Itch, relating also them to the L 2 (X,m) gradient flow P t of \ Ch. 

The following result is a small improvement of [3, Thm. 7.4], since we replace the slope of Ent 
w.r.t. Wd with the slope w.r.t. the (a priori larger) distance Wch- It can be obtained with the 
same proof. 

Lemma 8.1 (The Fisher information is dominated by the slope of the Entropy). For every 
probability density p € L\{X, m) one has 


4 J I D \/Pl™ dm < \D Wch Ent | 2 (pm) . 


( 8 . 1 ) 


Proof. Let pt = Ptp; we set pt '■= Pt m and p = pm. Denoting by \pt\ the metric derivative 
w.r.t. Wchi from Remark 7.2 we get 

|pit | 2 < F(pt) for a.e. t > 0 . (8-2) 

Applying (6.5) we get 


Ent(p) - Ent (p t ) = f F (p s ) ds > ]- f F(p s ) ds + ]- f \p s \ 2 ds 
Jo z Jo z Jo 


(8.3) 


> 


1 ( 1 


j VHps) ds) I I ds) > y( J VHps) ds'jWchip, Pt) ■ 


2\y/t, 

Dividing by Wch(p, Pt) and passing to the limit as 1 1 0 we get (8.1), since the lower semiconti¬ 
nuity of Ch yields 

VHP) < I™inf \ I v /F (p s ) ds . 

4U l J o 
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□ 

In order to identify the metric gradient flows of Ent with P t , we will also use the following 
result, see [3, Lem. 5.17, Thm. 8.1]. Its proof uses Proposition 5.5, Proposition 6.3 (e), (g), 
the estimates (6.4), (6.5), the convexity of F and the strict convexity of Ent, see also the next 
section for an analogous argument involving the same ingredients. 

Theorem 8.2. Let (pt)te[o,T\ be a curve of bounded probability densities with sup t ||/0f||oo < oo. 
Assume that pt = PtV* € AC 2 ([0, T]; (&(X), Wd)) and that pt satisfies the Entropy-Fisher 
dissipation inequality 

Ent (p 0 ) > Ent(p T ) + \ f \pt\ 2 dt + i [ F (p t ) d t . (8.4) 

4 JO Z Jo 

Then pt = PtPo for all t E [0,T] and equality holds in (8.4). 

Theorem 8.3 (Identification of gradient flows). Let (X, r, d,m) be an extended metric measure 
space with (X, d) complete. Let {pt)te[o,oo) be a curve of probability densities with ||pi||oo € 
L~([0, oo)) and set p t = PA n. and let us consider the properties 

(a) pt Is a metric gradient curve of Ent relative to W d starting from po; 

(b) pt is a metric gradient curve of Ent relative to FEch starting from po; 

(c) p t = P t po for all t E [0, oo). 

Then (a) =$■ ( b ) =>■ (c). If |DJ Ent | is lower semicontinuous in L+(X, m), then (c) =$■ (a). 

Proof. By Corollary 7.5 and the inequality W d < ILch, which yields a converse inequality at the 
level of slopes, the metric gradient curves in (a) are contained in the metric gradient curves in 
(b). On the other hand, by (8.1) of Lemma 8.1, the metric gradient curves in (b) satisfy the 
Entropy-Fisher dissipation inequality and therefore, thanks to Theorem 8.2, satisfy (c). Finally, 
under the lower semicontinuity assumption on |D^ Ent |, the identity 

Ent (pm) — Ent(Ptpm) = f F (P s p) ds 

Jo 

and the inequality F(p) > |DJ Ent | 2 (pm) (see [3, Thm. 7.6]) show that the class (c) is contained 
in the class (a). □ 

Remark 8.4. By comparison, the implications above can also be stated with the distance VLch,*- 
This is possible because, according to Proposition 7.9, Wch * is intermediate between Wch and 
W d . 


9. A STABILITY RESULT FOR CHEEGER’S ENERGIES 

In this section we consider an extended metric-topological space (X, r, d) and a monotone 
family of (r xr)-continuous approximating semidistances dj : XxX —>• [0, oo) as in Definition 4.1. 
Given m E &{X\ our goal is to prove a convergence results for the corresponding Cheeger 
energies. Since in view of the applications we have in mind we want to cover also the case when 
d, are semidistances, we have to adapt the construction of Section 6, thought for (extended) 
distances, to this slightly more general setting. 

Let (Aj,dj) be the quotient metric space, with n l : X —>• X{ the canonical projection. We 
choose in X t the standard topology r* generated by the metric structure, so that Lip(Xj, n, dj) 
is a standard metric-topological space and 7r* : X X^ is continuous (thanks to the (r x r)- 
continuity of dj). Thanks to the continuity of 7r* we can also define m; = (7r*)#m E 
thus providing the structure of metric measure space to X t . 

The map g i-A 7r* (g) = g o n 1 provides a linear isometry of L 2 (Aj,ntj) into L 2 (X,m). Then, 
denoting by T> t the closure of Lip b (A, r, d,) in L 2 (X, m), we notice that Dj C 7r* (L 2 (Xj, m*)), 
because any function in Lip b (X, r, dj) belongs to the image of 7r*. 
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Denoting by Chj and P\ the Cheeger energy and its gradient flow in (Xj, n, dj, ntj), the 
formulas 

Ch i(g) if / = sro-TT* € <(L 2 (Xj,ntj)) 

+oo otherwise (9-1) 

p if ■■= (Pig) o tP if / = g o 7T 1 € 4(L 2 (Xi, mO), t > 0 , 


Ch,(/) := 


enable to lift the Cheeger energy Ch* and its gradient flow P* to the subspaces 7r*(L 2 (Xj, m*)) 
of L 2 (X, m), retaining the metric gradient flow property. Since Chj have a dense domain in 
L 2 (X, m.j) it follows that the closure of the domain of Chj, namely 7r* (L 2 (Xj, m,;)), contains "Dj, 
so that Lemma 4.5 gives 

-D(Chj) is dense in L 2 (X, m) . (9.2) 

i&I 

The proof of the following theorem is inspired by various stability results based on T-convergence 
and on the energy dissipation point of view, see for instance [40], [24] and [25]. At the level 
of Chj, the only properties that will play a role are (9.2) and the energy dissipation inequality 
(9.4). The latter easily follows from the corresponding properties of Chj, PL 


Theorem 9.1. Under the previous assumptions on dj one has that Ch coincides with the largest 
L 2 (X, m) lower semicontinuous functional smaller than infj Chj. 

Proof. Let L* be the largest L 2 (X, m) lower semicontinuous functional smaller than L := 
infj Chj. Since dj < d, from the inequality 

J |D(c /0 7r)| 2 dm < J |D di (cjo vr)| 2 dm = J |D^.g| 2 dm, (9.3) 

we immediately get Ch < Chj, hence Ch < L and the lower semicontinuity of Ch gives Ch < L*. 
In order to prove the converse inequality, we fix a probability density p with C > p > c > 0 m- 
a.e. in X and denote by p\ the gradient curves of Chj starting from p l 0 , the L 2 (X, m) projection 
of p on D(Chj). By (9.2) we know that p l 0 —> p in L 2 (X, m) and the stability of D(Chj) under 
truncations immediately gives C > p\ > c m-a.e. in X; in addition, using the regularization 
estimate (3.7) it is easily seen that 

lim sup Lip(p*, (<5, oo)) < oo \/5 > 0 
iei 

(where the Lipschitz constant is computed w.r.t. the L 2 (X, m) norm). Hence, we may find a 
subnet /3 : J -» I and a curve pt such that lim jG j pf ^ = pt in the weak topology of L 2 (X, m) 
for all t > 0, with p : (0, oo) —>• L 2 (X, m) continuous. 

Our goal is to pass to the limit first w.r.t. j and then as t j. 0 in the energy dissipation 
inequalities 

Ent(^f (i) ) + l -f^ \^\ 2 m + FW\pPW) ds < Ent(p^m) , (9.4) 

with = pt^m, equal to the metric derivative of the curve w.r.t. and 

the Fisher information functionals associated to Ch^yp to prove that pt is the gradient 
curve of Ch starting from p. 

We first notice that the representation (3.2) of the action as a supremum, together with the 
monotone convergence him,- W ,r dfl()) = Wd imply that p s € AC' 2 ([0,t\', (^(X), Wd)) and that 

iiminf J \Us b) \ 2 p(j) ds - J Q ds , (9.5) 

where pt = P<ni and \pt\ denotes the metric derivative w.r.t. Wd- 

Let us denote by F the Fisher information functional associated to Ch and notice that Fj > F. 
Hence, combining (9.5) with (9.4) and with liminfjgj Ent(yuf^) > Ent(jUf) we get 

1 t' 1 

Ent (p t ) + - \p s \ 2 + F (p s ) ds < Ent(pm) . 

1 Jo 


(9.6) 
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Since t is arbitrary this inequality, according to Theorem 8.2, proves that pt = P tp, where 
P t is the L 2 (X, m)-gradient flow of Ch; in addition, we can still use Theorem 8.2 to obtain 
that equality holds in (9.6). Therefore we obtain from this limiting argument the additional 
informations 

liminf Ent(pf^m) = Ent(pttn) , liminf f ds = [ \fi s \ 2 ds , (9.7) 


ieJ 


ieJ 


(that we shall exploit in the next theorem) as well as 

liminf / F^(pf^) ds = / F(p a ) ds . 
7gJ Jo Jo 


If we assume 


1 [* 

lim sup - F(/ 9 s ) ds < F (p) 

40 * Jo 


we can find, thanks to the convexity of t(j) —>• 0 such that the functions 


rt(j) 


Vi := 


t(j) 


ds 


satisfy lim inf j F < F(p), so that 


liminf Ch p(j){y/vj) ds < Ch {yfp) . 
7G J 


(9.8) 

(9.9) 


(9.10) 


In order to prove that this implies L*(y/p) < Ch(y / p) it is sufficient to show that yjvf —» yfp 
in L 2 (X,xn). This can be proved as follows: since Wa „,.. (vj, p) —>• 0, we obtain 


lim 

76 J. 


fvj dm = f fp dm for all / € |^J Lip(X, r, d,) . 
J iei 


Hence, by tc*-compactness of closed balls in L°°(X, m) and density of U,;Lip(X, r, dj), Vj —>• p 
weakly* in L°°(X, m). Then, the entropy bound Ent(vjm) < Ent(pm) implies convergence of 
vj in m-measure. 

Now we remove the assumption (9.9). Given a probability density p € D( Ch) with C > 
p > c > 0 m-a.e. in X we obtain by the previous step applied to p = P tp, the inequality 
Ch(-v/Pf p) > L*(\JPtp) for a.e. t > 0. By the chain rule, since Ch(P f/ o) —>• Ch(p) as f | 0 
implies |DP f p|^ —>• iDpI^ in L 2 (X, m), we obtain Ch(\/P f p) —> Ch(- v /p) as t | 0 and therefore 
Ch(^p) > L t (y/p). 

This proves the inequality Ch > L* on all bounded and normalized functions p with inf p > 
0. Finally, we can extend by standard approximation arguments the inequality first to all 
bounded functions (by homogeneity and translation invariance) and eventually to all functions 
in L 2 (X, m). □ 


A byproduct of the previous proof and of the identification of gradient flows is the following 
stability result of gradient flows of Cheeger’s energies; the stability proof provides also a crucial 
regularity property of Cheeger’s energies that we call, as in [5], r-upper regularity (see also 
Definition 12.4). We use the same notation of the statement of Theorem 9.1 and the notation 

|D(0 o 7p)|™ id . := iDgl™ 0 7P g £ D(Chj) , 

consistent with the definition (9.1) of Ch ? . It is not difficult to show, along the lines of (9.3), 
that |D/| Wjdi > ID/I^d m-a.e. in X for all / € -D(Chj). 

Theorem 9.2 (Stability of gradient flows and T-upper regularity of Cheeger energies). Under 
the same assumptions of Theorem 9.1, let po £ L°f(X, m) and let p\ (resp. pt) be the L 2 gradient 
curves of Chi (resp. Ch) starting from p l 0 , the L 2 (X,xn) projection of po on D(Chj). Then 
p\ —>• pt strongly in L 2 (X, m) for all t> 0. 

In addition, for all f £ D(Ch) there exist a subnet f3 : J —» I, bounded and dp^yLipschitz 
functions fj with fj -A / in L 2 (X, m) and Lip a (/j, dpyy •) —>• \h>f\ w in L 2 (X, m). 
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Proof. The weak convergence of p\ to pt in L°°(X, m) has already been achieved in the proof of 
the previous theorem. To show that the convergence is actually strong, we use the first equality 
in (9.7), which can be improved to 

limEnt(pJtn) = Ent(p t m) 
iei 

since it can be applied to an arbitrary subnet. 

Also the last statement can be obtained with a small refinement of the proof of Theorem 9.1: 
it suffices to start from (9.10) and then to proceed as in the rest of the proof to obtain kj E 
D( Ch^^)) with kj -A / in L 2 (X, m) and liminfj Ch^(j)(fcj) < Ch(/). Now, the inequality 
\Vkj\ Wt d m > |D%U )d m-a.e. in X gives 

J - l D /U,d) 2 dm < Ch m {kj) + Ch(/) -2 j |D%| 2 ,d dtn . 

Hence, |D/c ? \ w ^p, :i] —>• |D/k d in L 2 (X, tu) along a further subnet. Finally, writing kj = kjonp^, 
by applying Proposition 6.3(b) to kj we can find fj E Lip b (Xg(j), d^)) with 

5™/i D %L,d W) - Li P a(fj, -)| 2 dva m = 0 . 

Setting fj = fj o 7 TpQ\ we obtain the hnal part of the statement. □ 

10. Energy measure spaces 

In this section we study a class of extended distances in the framework of Dirichlet forms, 
basic references on this topic are [15], [23]. 

10.1. Dirichlet forms, energy measure spaces and the Bakry-Emery condition. 
Definition 10.1 (Energy measure space). We say that (X,B,£, m) is an energy measure space 

if: 

(a) B is a a-algebra in X and m : B -A [0,1] is a probability measure; 

(b) £ is a strongly local and Markovian Dirichlet form in L 2 (X, m) = L 2 (X, B,m) whose 
domain 

Y = V:={fe L 2 {X , m) : £(f, f) < oo} 
is dense in L 2 (X, m); 

(c) £ admits a carre du champs operator defined on V n L°°(X, m). 

Recall that the Markovian property means £(4> o /, 0 o /) < £(f,f) for all / € V and all 
1-Lipschitz cj) : M -a R. We recall that the carre du champs operator is the bilinear form 
T : (¥n L°°(X , m)) -A T 1 (A1, m) providing a local representation of £. When T exists (in more 
general situations it has to be understood as a measure-valued operator), it is characterized by 
the identity 

J F(f,f)cp dm = —^£(/ 2 , (p) + £(f, ftp) V/, 93 € V n L°°(X, m) . ( 10 . 1 ) 

We use the standard abbreviations £(/), T(/) for £(f,f), T(/,/), respectively, in the sequel. 
The domain V of £ is endowed with the Hilbert norm 

11/11^=11/111+^/) (10-2) 

and we denote by As the Banach algebra {/ € V : /, T(/) € L°°(X, m)} endowed with the 
norm (see also (7.3)) 

ll/ILHI/ILHIwnL < 10 - 3 ) 

We now recall the main properties of the heat flow P" associated to £. It can be defined as 
the unique locally absolutely continuous (in fact analytic) map t E (0,oo) i-A f t € L 2 (X, tn) 
satisfying 

^-ft = A £ f t for a.e. f E (0, oo) , lim f t = f in L 2 (X,m) , 


EXTENDED METRIC SPACES 


33 


where A sf, the infinitesimal generator of the semigroup, is related to £ by 

v = A £ f <*=> / <E V, v € L 2 (X,m), J vg dm = -£{f, g) Mg G V . 

Using the characterization 

v = A £ f <=> - v€ d~±S(f) 

it is easy to check that Pf is also the metric gradient flow of \£ with respect to the L 2 (X, tn) 
distance, according to (EDI) and (EVI/<) with K = 0. 

We recall now a few basic properties of P £ . 

Since (thanks to the Markov property) P £ is a contraction also in the L 1 (X,xn) norm we 
can canonically extend it to a linear semigroup in L 1 (X,m), thanks to the density of L 2 (X, m) 
in L 1 (X, tn). This extension of the semigroup, for which we retain the notation P £ , obviously 
satisfies 

/ 9 Pf/dm = //Pf 9 dm V/ e C(X. m) . 9ei ~(X, «),*>0. (10.4) 


Proposition 10.2 (Properties of P ct and derivative of the entropy). P* is a Markov self-adjoint 
linear semigroup in L 2 (X, m), As has a dense domain and t i —> j Pf/lnPf/ dm is locally 
absolutely continuous in [0, oo) for all p = f m G D(Ent) with 

- A / Pf/lnPf/ dm = 4£(JpJf) = f dm for a.e. t> 0 . (10.5) 

d tJ V J{Pff>o} p tf 

In addition, if L 2 (X, m) is separable, V is a separable Hilbert space. 

Proof. The first properties are standard in the theory of semigroups, while (10.5) follows by the 
chain rule if / > c > 0 m-a.e. in X and by an easy approximation, since m is finite, in the 
general case. 

In order to prove separability, recall that, according to a standard results in the theory of 
semigroups (see for instance [4, Lem. 4.9]), it suffices to find a separable and P^-invariant 
subspace V' C V. The subspace 

V':=UPfL 2 (X,m) 
t> o 

is P" -invariant. Its separability follows by the separability of L 2 (X, m) and from the regularizing 
estimate £(Pf f) < ||/|||/t for all t > 0 and / € L 2 (X,m), which corresponds to (3.7) with 
K = 0, x = f and z = 0. □ 

We recall one of the possible formulation of the functional Bakry-Emery condition [11] for 
energy-measure spaces [5, 12]; other equivalent characterization in this abstract framework may 
be found in [5, Sect. 2.2], see also [10]. 

Definition 10.3 (Bakry-Emery condition via gradient contractivity). We say that the energy- 
measure space (X,B,£, m) satisfies the Bakry-Emery condition BE(A', oo), K £ M, if 

for every g € As T(Pfg) < e~ 2Kl PjT(g) m-a.e. in X, for all t > 0 . (BE(it',oo)) 


10.2. Extended distances induced by an energy measure space. In this context the 
definition of CE 2 (X, Ch, m) given in the metric setting can be immediately adapted, namely a 
curve p s of probability densities belongs to CE 2 (X, £,m) if for some c € L 2 (0,T) one has 


fpt dm - / fp s dm 


-/V)(/ r {f)p 


• r dm I 


\ 1/2 


dr 


VfcAs 


( 10 . 6 ) 


for 0 < s < t < T. The least c will still be denoted by \\p' t \\. 

Also the counterparts W £ and W £ ,* of Wch and Wch,* can be immediately defined: 
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Definition 10.4. For po = pom, pi = p\m £ £P a {X) we define 


W|(/x 0 ,Mi) :=inf {/ \\Pt\\ 2dt: Pt <E CE 2 (X,£,m) 


(10.7) 


and 

Ws Jpo,Pi) ■= 2sup / (<£ipi - 4> 0 po) dm , (10.8) 

<j> J 

where the supremum runs among all (Jz? 1 <g> B)-measurable bounded maps f(t,x) = (j>t(x) satis¬ 
fying f £ C w *([0, l];L°°(X,m)) D L 1 ([0,1];¥), and 

— ft H—r<0 in (0,1) x X, in the duality with As . (10.9) 

dt. 2 

As for the metric theory, we will use use often the simplified notation Ws(po, pi), Ws.*(po- P\ ) 
for Wf(p 0 m,pitn), W£ ; *(pom, pirn) respectively. 

Arguing as in the proof of Proposition 7.4, it is easily seen that Ws is length and that Wj? 
is jointly convex. In addition, with the same proof given in the metric setting, t pf := P t p £ 
CE 2 (X, £,m) in any bounded interval [0,T] with 

||p(|| 2 < f dm for a.e. t> 0. (10.10) 

J{pt> o} Pt 

Concerning Ws,* one can also extend the same considerations of Lemma 7.7, obtaining in par¬ 
ticular an equivalent definition where the supremum in (10.8) runs in (7 fc ([0,1]; As as for 
Remark 7.8, it is also easy to check the joint convexity and the lower semicontinuity of IPj* 
with respect to the weak L 1 -topology. 

The following result can be obtained with the same proof given in the metric setting, see 
Proposition 7.9. 

Proposition 10.5. Ws > Ws,* on £P a (X) x & a (X). 

Let us quickly discuss two cases when it is possible to prove that the distance Ws (and a 
fortiori Ws,*) between two probability densities is finite. 

Lemma 10.6. Let us suppose that £ satisfies the global Poincare inequality 

2 

/dm dtn<cp£(/) for every f eV (10.11) 

Then if po, Pi € L 2 (X,xn) are probability densities with Pi > g > 0 m-a.e. in X, i = 0,1, we 
have 

w£(p 0 ,pi) < — / \pi - Pol 2 dm . (10.12) 

Q J 

Proof. We just take the linear connecting curve p s := (1 — s)po + spi, and we observe that, with 
/ = j f dm, for every 0 < s < t < 1 one has 

(p\ ~ Po)f dm = J (pi - p 0 )(f - f) dm 
< \\pi - P 0 II 2 II/ - /II 2 < (y) 7 \\pi Po 11 2 ^ J pWU) dm) 7 . 

□ 

Lemma 10.7. Let us suppose that £ satisfies the Logarithmic Sobolev inequality 

2Ent(pm) < c LS [ dm = 4c l S £(Vp) (10.13) 

J{ P > 0 } P 

for every probability density p with ^fip £ V. Then the Talagrand inequality holds 

^Wj(/z, m) < CLsWit(p) for every p £ D(Ent) . 

In particular, Ws(po, pi) < 00 whenever p n = pirn £ D(Ent). 




(10.14) 
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Proof. We follow the argument of [37] to use the Logarithmic-Sobolev inequality in order to 
show that the heat flow Pf p connects /itom and to estimate its length. 

If p = pm € .D(Ent) the curve pt = Pt^n where pt = P fp belongs to CE 2 (X, £,m) in any 
bounded interval [0, T], since for every / £ As and for every 0 < s < t 


f{pt ~ Ps) dm = - / / T(/, p r ) dm dr 


< 


the same formula shows that 


r (pr 
s ^J{p r > 0} Pr 


\ i/2 / r \ 1/2 

dmj ( / p r P(f) dmj dr ; (10.15) 


lM dm v /2 


Mil < (/ 

K J{pt>0} Pt 

On the other hand, for every time t with Ent(/r^) > 0, (10.5) yields 


d_ 

dt 


1/2 


f IM 

lpt> 0 Pt 


dm 


(4Ent(/r t )J 


- 1/2 


> 


1 


2c LS J{p t > 0} Pt 


™ AmP ' 2 


(10.16) 


Since the left hand side is integrable in (0, oo), also the right hand side is integrable and, in 
particular, the essential liminf of f^ pt>0 y P{pt)/pt. dm as t — >• oo is null. From (10.13) and the 
monotonicity of entropy we conclude that Ent(/p) -A 0 as t — >• oo and thus pt —> 1 in L x (X, m) 
as t — >• oo. Thus we can pass to the limit t —>• oo in (10.15) and obtain that t eA p t connects 
p, to m and is admissible (after reparametrization on a finite interval) in the definition of Ws- 
Moreover, (10.16) provides the velocity estimate 



Eventually, integrating this last inequality from 0 to oo, recalling that Ent(/r) > 0 and that 
Ent(//t) -A 0 as t —>• oo the Talagrand inequality (10.14) follows. □ 


When BE(A", oo) holds with K > 0, the well known argument of Bakry and Emery (we will 
also provide a proof based on the EVI formulation, which in turn follows by BE(AT,oo), see 
Corollary 11.5) yields the validity of the Logarithmic Sobolev inequality (10.13) with c ls = K, 
provided (X, B, S. m) satisfies the irreducibility condition 

/ € V, £(f) = 0 => / = c m-a.e. in X for some c € R (10.17) 

which is also equivalent to the L 2 -ergodicity of the semigroup P £ \ 

lim P?/ = [ f dm strongly in L 2 (X, m), for every / € L 2 (X, m) . (10.18) 

t— >oo J 

Corollary 10.8. If (X,B,£, m) is irreducible according to (10.17) and BE(AT, oo) holds with 
K > 0 then (10.13) and (10.14) are satisfied with c ls = K- In- particular every couple of 
probability measures pi = p t m € D(Ent) has finite distance Ws(po, pi) < oo. 

Our goal is now to prove that under the contractivity assumption BE(A', oo) the upper length 
distance associated to Ws,* coincides with Ws- In the proof the following lemma will play a 
crucial role. 

Lemma 10.9. If BE(K,oo) holds, then for any curve {pt )te[o,i] in AC 2 ([0,1], (&(X),Ws,*)) 
with p t = pt.m and any € As one has (denoting by \p t \ the metric derivative w.r.t. Ws,*) 

^ J p s f> dm 



Y{4>)pt dm for a.e. t £ (0,1) . 


(10.19) 
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Proof. First, defining We,*,i < We,* as in (7.15) of Remark 7.10, we obtain that s i-A f <fp s dm 
is absolutely continuous for all 4> € As- Let if E Ae n D(Ae) with inf if > 0. Using the identity 
— if~ l A £ if + Ae log if = — r(log?/>) and the gradient contractivity condition it is easy to check 
that, for K > 0, 

ifs := 2Pf log Ps- S if S € [0, 5] 

is admissible in We*. We fix a point t where s 4 /i s is metrically differentiable and s i—>■ 
f p s log if dm is differentiable. Fix s > t and 5 = A (s — t ) with A > 0. From the inequality 


(Pf log if) Pt dm - (log P sif)ps 


dm 


— T^Ch ,*(Ps>Pt) j 


dividing both sides by (s — t ) 2 and letting s —>• t gives 

<1|M, 


It follows that 

|— / ptlogif dm| < A 

By a simple approximation, the same inequality holds for a.e. t E (0,1) for all if E Ae with 
inf if > 0, i.e. removing the assumption if E D(Ae). By minimizing w.r.t. A and setting if = e^ 
we get the result. 

In the case K < 0 we need to consider the reparameterization 2P^ log Pf^_ e( - S ^, s € [0, <5], 

where 0(0) = 0 and 0'(s ) = e~ 2Ks . Since 0(s) = s + o(s) as s | 0, the same expansions above 
work with this modified function. □ 


r(iog if)pt dm + ^lAtl 2 


{A e log if - ) Pt dm — ^ 


Pt log if dm 


Remark 10.10. Notice that Lemma 10.9 and the next proposition could be reproduced even in 
the metric setting, since the proof of Lemma 10.9 used only the diffusion formula A e<f(f) = 
cf'{f)Aef+(f"{f)T{f), known to be true also in the metric setting (see [26, Prop. 4.11]), where A 
might be nonlinear. On the other hand, we preferred to state these results in this section because 
gradient contractivity conditions are expected to hold only in presence of quadratic energies. In 
the subclass of Minkowski spaces, it is known (see [36]) that contractivity of the heat flow w.r.t. 
to Wd holds if and only if the Minkowski structure is induced by an inner product. 


Proposition 10.11 (We is the upper length distance of We,*). If BE (K, oo) holds, then We is 
the upper length extended distance associated to We,* according to (3.9). 


Proof. Since We is length and larger than We,*, one inequality is obvious. 
Lemma 10.9 to obtain 



fPs dm 


f. 


\ 1/2 

< / \Pr \( / P(f)Pr dmj dr 


Let us apply 


for all pt = Pirn E AC 2 ([0,1], (&(X), We,*)) and all / E Ae- It follows that pt is absolutely 
continuous w.r.t. We and that 


We(pi,Po) < f \/It\ 2 d t , 

Jo 

which provides, by the arbitrariness of pt, the converse inequality. 


□ 


We conclude this section by proving that BE(/\, oo) allows for a further regularization in the 
definition of We,*. Let us first introduce the Banach space 

L>oo( As) := {/ € L°°(X, m) n D(A e ) : A £ f € L°°(X, m)} (10.20) 

endowed with the graph norm H/Hdoo := ||/||oo + HAs/Hoo and let us recall (see [7]) that if 
BE(iv, oo) holds then D oa (Ae) is an algebra, continuously imbedded in Ae', in particular, there 
exists a constant Ck > 0 satisfying 

\\m\L < Ck ll/llco \\f\\ Doo < C K \\f\\ 2 Doo for every / E Ax>( A e ) 


( 10 . 21 ) 
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Lemma 10.12. The extended distance Wg ,* can be expressed by the duality formula (10.8) where 
the supremum runs among all f £ (^“([0,1]; D oa (Ag)). 


Proof. Arguing as in Lemma 7.7 it is not restrictive to take the supremum of (10.8) assuming 
f £ C' oo ([0,1]; Ag). We then set ff := h e ft where h £ is the mollification of introduced by 
(10.28). Since h e is a bounded linear operator As to D^Ag), the curve t eA </>f still belongs 
to C°°([0,1]; D 00 (Ag)). On the other hand, the commutation property (10.29) shows that f £ is 
still a subsolution to (10.9). Since f f —>• ft weakly* in L°°(A, m) as e 4- 0, it is immediate to 
check that 


lim 2 


J (pi4>\ ~ Pofl) dm 


(Pi0i - 


dm 


□ 


10.3. Bakry-Emery condition and contractivity of the Heat semigroup. We say that 
P e is A'-contr active w.r.t. Wg if 

Ws(Pff, PffiO < e~ Kt Ws(f,g) Vt > 0 (10.22) 

for all f,g£L l + ( X , m) with f f dm = f g dm = 1. 

The proof of the Bakry-Emery gradient estimate below will use some results of the theory 
developed recently in [9, Thm. 4.6] for the continuity equation in metric measure spaces. Its 
proof uses Hilbert space techniques and a vanishing viscosity argument, choosing as algebra of 
functions the set ¥ n L°°(X , m). 

Theorem 10.13. Assume that L 2 (X, m) is separable. Let V £ D(Ag) n L°°(X,m) with 
AgV £ L°°(X, m). Then for any p £ L°°(X, m) and any T £ (0,oo) there exists p £ 
C'°*([0,T];L oo (A,m)) with po = p and 

J fp Sl dm - J fp S2 dm = j T(/, V)p r dm dr V0 < si < s 2 < T (10.23) 


for all f £ V n L°°(X, m). In particular p £ CE 2 (X, £,m) and the metric derivative of s e-)- p s 
w.r.t. Wg can be estimated by 

\p s \ 2 < J r(¥)/9 s dm for a.e. s £ (0, T) . (10.24) 

Now we prove that the Bakry-Emery gradient estimate BE(A', oo) is equivalent to AT-contractivity 
of P £ w.r.t. Wg. 

Theorem 10.14. If P £ is K-contractive w.r.t. to Wg and L 2 (X, m) is separable, then the Bakry- 
Emery condition BE(A", oo) holds. Conversely, if BE(K, oo) holds, then P 2 is K-contractive 
w.r.t. Wg. 

Proof. Fix t > 0. We first consider g £ Y H L°°(X,m) with Agg £ L°°(X, m). Fix a measure 
po = pom £ &(X) with po £ L°°(X,m). Let us denote by (p s ) s g[o,i] the solution to the 
continuity equation driven by the gradient of the constant (w.r.t. the time parameter s) potential 
V := Pfp starting from po, given by Theorem 10.13. Now, let us first note that for any h > 0 
one can apply (10.23) with / = Pfg to get 


Ph.Pt 9 dm — / po Pf g dm = 


T{P £ t g)p s dm ds . 


(10.25) 
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On the other hand, putting p! s = (Pf p s )m and denoting by |/i*| the metric derivative of the 
curve s ha p*, we can estimate: 


p h P\gdxn- / p 0 Pfgdm = / g?tp h dm - / gP b t p 0 dm < / \p 


:.t\ 


T(g)P^p s dm ds 


< e 


—Kt 


\p s \ 2 ds 


F(g)PtPs dm ds 


< e 


—Kt 


T(P £ t g)p s dm ds 


PfT(g)p s dm ds 


where we have used first the inequality \fr s \ < e Kt \p s \ (derived by the Wasserstein contraction 
(10.22)) and then (10.24). Combining with (10.25) we obtain 

J Jt(P* 9 )p. dm ds < e~ 2Kt j j Pfr(p)p s dm ds . (10.26) 

Dividing by h, letting h | 0 and using the weak* continuity of the curve p s we finally get: 

J r(PffiOA) dm < e~ 2Rt J Pf T(g)p 0 dm . (10.27) 


By homogeneity, the same inequality holds for any po € L 5° ( X , m) . This clearly implies 
T(Pfp) < e~ 2Kt PfT (g) m-a.e. in X. 

To prove the assertion for arbitrary g £ Vfl L°°(X , m) we argue by approximation. Consider 
the following mollification of the semigroup, defined for e > 0 and / £ L 2 (X, m) via: 

h£ f = f 0 °° h ( 7 ) e(i " A0)< p tf dt > ( 10 - 28 ) 

with a non-negative kernel g £ C£°(Q, 00 ) satisfying g(t) dt = 1. It is easily seen that h £ is 
a linear contraction in Ag satisfying 

||h e /l|oo < ll/lloo, P(h £ f) < h £ T(/), for every / £ Ag (10.29) 

and 

/ £ L°°(X, m) =}► A s f £ L°°(X,m), WAgf}^ < QH/IU (10.30) 

for some constant C e > 0. From the previous argument we thus obtain 

J P(Pf g £ )po dm < e~ 2A< J P^T(g e )p 0 dm , 

for any po £ L°£(X, m). To conclude, it is sufficient to check that as e 4 . 0 we have P \g £ —> P fg 
and g £ -A g in V as e -A 0. By convexity of £ this in turn follows from the fact that for any 
/ € ¥ we have £(Pf / — /) -A 0 as s | 0. 

Eventually we can prove the statement for any g £ V with a truncation argument. 

In order to prove the converse statement, notice first that BE(/C, 00 ) implies that Ag is 
invariant under the action of the semigroup. Then, recalling the definition (10.6) of CE 2 (X, £ , m), 
(10.4) shows that Pf maps curves p s £ CE 2 (X, £,m) to curves a s := Pfp s £ CE 2 (¥, £,m) with 
||cr'|| < e~ A, ||p(,||. By minimization we obtain the contractivity property. □ 


We prove now, by standard methods, that BE(K, 00 ) implies JC-contractivity of P c " also 
w.r.t. Wg^, a property that will also follow as a consequence of the EVI estimates of the next sec¬ 
tion. We don’t know if the converse implication, known to be true for Wg under the separability 
assumption on L 2 (X,m), holds. 

Proposition 10.15. IfBE(K, 00 ) holds, then Wf ; *(Pf po, Pfpi) < ^~ Kt Wg^(po, pi) for all t > 0 
and pom, pirn £ & a (X). 



EXTENDED METRIC SPACES 


39 


Proof. Take <f admissible in the definition of Wg * and note that if = e 2Kt Pftp is again admissible. 
Indeed, BE(K,oo) gives 


-Tj + in,/,) < 




< 0 . 


Thus the dehnition of IFg,* gives: 

^f,*(Po,Pi) > J ipiPi dtn ~/ V’oPo dm = e 2Kt j (^iPfpi dm - (fo^tPo) dm 

and the statement follows by taking the supremum w.r.t. (j). 


□ 


11. From gradient contractivity to EVI and consequences 

In this section m) is an energy measure space satisfying BE(K,oo) for some K £ R. 

The main result of this section is: 

Theorem 11.1 (P f satisfies EVI*- relative to Wg,*). For all p = pm € ^ ga (X), a € D(Ent) 
with Ws^(n,a) < oo one has Ent(Pfpm) < oo, Wg ; *(Pfptn, a) < oo for all t > 0 and (recall 
that stands for upper right derivative) 

^ \ W £,*( P tP m ’ a ) + ^ W sA P tP m ^) < [Ent(a)-Ent(Pfpm)] Vt > 0 . (11.1) 

Before entering into the technical details, let us briefly explain the main idea of the proof. 
First of all, thanks to the semigroup property of Pf, it is sufficient to prove an “integrated” 
version of ( 11 . 1 ), namely 


t 1 


2 W s,*(?tP m ^) + tEnt(Ft pm) < j—^ -W £ ^(pm,a)+tEnt(cr) Vt > 0 . (11.2) 

Indeed, the expansion t/l 2 /f(t) = 1 — Kt + oft) and the lower semicontinuity of Ent provide 
(11.1) at t = 0, and the semigroup property provides the result for all positive times. Notice 
also that ( 11 . 2 ) implies all finiteness properties in the statement of the theorem. 

We express the left-hand side by using a dual representation formula, obtained by combining 
(10.8) with the classical conjugate representation of the Entropy functional 


Ent (pm) = sup 

CeL°°(X,m) 


pC dm — J e^ 1 drn^ = 


sup 

CeL°°(A-,m)nv 


pC dm — 


J e*’ 1 dmj , 


where we have restricted the supremum to functions in V by a standard regularization argument 
(e.g. by applying (10.28)). After the simple transformation ( = 1 + f>/t yields 

i Ent (pm) — t = sup f / pf’ dm — tj A’P dm') . (11.3) 

^6L°°(.X' I m)nV W J ' 

Replacing p with Pf p, adding the squared distance term, and using the symmetry of P“, we end 
up with 

~W!*(Pfpm,< 7 ) + tEnt(Pfpm) — t = sup ( f p Pf (<j>i + if) dm — j f>o da — t j e ^ 4 dm) , 
2 ’ (<j>t),il> J J J 

(11.4) 


where f> runs among subsolutions of (10.9) and f> runs in L°°(X, m) fl¥. 

Let us now suppose that for every choice of (</> s ) s g[o,i] and f> £ L°°(X, m) n V we can find a 
curve (V ; s)sg[o.i] in As such that 


d 


‘ 2Kts 


f>i=f> and — Pts{4>s + f’s) H -— r(Pf s (</> s + ^ s )) < 0 . 


(11.5) 
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Recalling (7.13) and (7.14) with a = 0, b = 1, i?(s) = e 2Kts , such a curve provides the following 
upper bound for the term inside the “sup” in (11.4) 


/ 


p Pf {(f>i + 0) dm — J 0o d a — t J e0^ dm 

f p Pf (0i + 0i ) dm - J (0o + 0o ) dcr + J 0o d<7 — t J e* o/t dm + t j (e* o/t - e^*) dm 


< 


h K{t) 


W £ ^(pm, a) + t Ent(a) — t + t — e^^) dm 


( 11 . 6 ) 


Expanding (11.5) and recalling that e 2Kts V (P^ s {(j) s + 0 S )) < Pf s r(0 s + 0 S ) by the Bakry-Emery 
condition, we see that (11.5) is surely satisfied if 

Pf s (^0 s + tA f 0 s + ^ r (0 s ) + tA f 0 s + T(0 S ,0 S )) <0 (11.7) 

where we used the fact that 

p f,(4^ + l r( «)<o 

since 0 is a subsolution of (10.9) and Pf s is positivity preserving. This property and the non¬ 
negativity of T(0 S ) show that a candidate for (11.7) is provided by the backward Cauchy problem 

^0s + tA £ ip s + r(0 s ) + tA £ <j) s + r(0 s ,0 s ) = 0, s € [0,1], 01 = 0 (11.8) 

which can be reduced to the linear backward parabolic problem 
d 


ds 


<0 + t^sCs + CsAg0 s + r(0 s ,0 S ) — o, s € [o, l], 


(11.9) 


by applying the well known Hopf-Cole transformation 

Cs := ^ s/t . ( 11 . 10 ) 

In conclusion, we have found that solving (11.9) and setting 0 S := tlog( s we get the bound 
(11.6). Miraculously enough, since 

J (s^sps dm = - J T(0 S , C0 dm, t J A £ Q dm = 0 

equation (11.9) is mass preserving, so that 

J e0 O/ ^ dm = J Co dm = J <0 dm = J e 1 ^ dm 

and with this particular choice the last integral term of (11.6) vanishes; since 0 and 0 are 
arbitrary, we obtain (11.2). 


Let us now check the technical details of the above argument. We divide the proof in a few steps: 
first of all, we will prove the existence of a sufficiently smooth solution to (11.9). We will then 
show that it takes values in a compact interval of (0,oo), so that it will not be difficult to check 
that the logarithmic transformation 0 S = t log0 s provides an admissible solution to (11.7). 

Step 1: for every t > 0, (0 € V and 0 € C' 1 ([0,1]; D 00 (A £ )), there exists a solution C £ 
IT 1,2 (0, 1; L 2 (X, m)) n L 2 (0,1; D(A £ )) (and thus in C° ([0,1]; V); of (11.9). 

Reversing the time order setting ( s := Ci-s, 4>s '■= 0i -s and recalling the “integration by 
parts” formula 


- J(A £ (j) s )CsV dm = £((j) s ,CsV) = J CsF((j>s,v) dm + J r]T((j) s , (0) dm i)£V (11.11) 

which holds since A0 s ,r(0 s ) € L°°(X, m), (11.9) is equivalent to the forward Cauchy problem 

^Cs-tA £ Cs-CsA £ f s -T(p s ,Cs) = 0, s€[0,l], Co:=e^eL 2 (X,m) (11.12) 

which admits the variational formulation 

d /* 

— / CsV dm + a s (C, r/) = 0 in (0,1) for every rj € V, (11.13) 
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where (os) s g[o,i] is the continuous family of bounded bilinear forms in V x V 

a s (C, rf) := t£ (£, v) + J C^&s, »i)dtn (,»?GV 

Since C := sup,, ||r((/> s ) 1 / 2 || 0O < oo, we get 

f (T(j) s ,ri) dm < C\\Ch£(v ) 1/2 
and we easily prove that there exist A, a > 0 (depending on t) such that 

C) + Alien! ^ a IICIIv 


(11.14) 


(11.15) 


(11.16) 


A (unique) variational solution £ E VL 1 , 2 (0,1; V') ni 2 (0,1;V) (and therefore continuous with 
values in L 2 (X, m)) then follows by applying J.L. Lions Theorem, see [33, Sect. 4.4, Thm. 4.1], 
On the other hand, (11.12) and the uniform L°° bound on A g(j> s show that 

j-l-tA £ ( s £L 2 (0,l-,L 2 (X,m)) ; 

since —tAg is the selfadjoint operator in L 2 (X, m) associated to the symmetric Dirichlet form 
t£ and since £o E V, the standard regularity results for variational evolution equation in Hilbert 
spaces (see, e.g. [16, Chap. Ill, Sect. 3] yield £ s E VL 1 , 2 (0, 1;L 2 (A, tu)). Eventually the equation 
(11.12) provides the L 2 (0, l-,D(Ag)) regularity. 

Step 2. Under the same assumptions of the previous step, if |Ag0 s | < D m-a.e. for every 
s E [0,1] and 0<a<(i</3<oo m-a.e., then 

ae~ D(1 ~ s) < Co < m-a.e. in X for every s E [0,1] (11.17) 

We just observe that for every function 9 E C' 1 ([0,1]) the perturbed solution co s := ( s — 9 S 
satisfies the equation 

- tA £ u s - u s A £ f) s - T(jj) s ,u s ) = f s , f s = -(#' + 6 s As4> s ), s E [0,1] (11.18) 

which can also be written as 

— J u s r] dm + a s (co s , r/) = J / s w s dm for every 77 E V a.e. in ( 0 , 1 ). (11.19) 

Choosing 0 S = (3e Ds we get f s < 0 and u >0 < 0. Choosing r] s := ( lv s )+ hi (11.19) and using the 
Leibniz rule (whose validity can easily be justified in this setting) we get 


7 ^ f V 2 s dm - A f rjl dm < 0 


( 11 . 20 ) 


where we used the fact that 


a s (w, to_|_) = a s (u + , uj + ) > —A J (lv+) 2 dm for every weV 

Since rj 0 = 0, (11.20) yields r] s = 0 m-a.e. in X for every s; we thus obtain uo s < 0 and therefore 
Cs A (3e Ds . The same argument, choosing 9 S := ae~ Ds and r] s = (w s )_ yields the other inequality 
Cs A ae~ Ds . 

Step 3: Ifcf E C' 1 ([0,1]; D 00 (Ag)) is a subsolution to (10.9) then if s := flog£ s satisfy 


Ptsirf’s + ’f’s) + e 2Kts -r(Pf s (£) s + ifgj) <0, s E [0,1] 


■ ZS\TO 1 TO/ 1 - 2 “ V ZS\TO t TO// — ”7 . L“7 -J (H‘^1) 

The transformation if s := tlog£ s is admissible thanks to the lower and upper bounds proved in 
the previous step; using the fact that 

T((/) s ,if s ) = — T(£> s , C s ), A £ i/j s + tF(if s ) = ^rAeCs in L 1 (A, m) 

S s S s 

we obtain (11.8); notice that if E VL 1,2 (0, 1; L 2 (X, m))nC'°([0, 1]; V) and Agip E L 1 (0, 1; L l (X, m)). 
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Since 0 is a subsolution to (10.9) we get 

+ f’s) + + Vb) + 2 F(^s + V’s) — 0, s € [0,1] (11.22) 

applying the positivity preserving Pf s and observing that 

^ = ~b tAf Pt s <Ps = "b (fs'j 

whenever <p £ W 1,2 (0,1; L 2 (X, tn)) with Ap<p € L 1 (0,1; L 1 (A, m)), we get 

+ i/> s ) + Pf s -r((/>,,+i/T,) < 0, s £ [0,1] (11.23) 

which yields (11.21) by the BE(ii, oo) gradient commutation property. □ 

The following corollary is a direct consequence of (11.2), see also the metric regularization 
estimate (3.6). 

Corollary 11.2 (LlogL regularization). For any p = pm € & a (X ) and a £ H(Ent) we have: 

Ent(Pfpm) < Ent(o-) + e2K f_ 1 Wf,*(7h ^) Vi > 0 . (11.24) 

Also the following corollary is a direct consequence of Theorem 3.5 and of the fact that Ws 
is the upper length distance induced by Ws,*- 

Corollary 11.3 (P £ satisfies EVI^ relative to Ws). For all p = pm € & a (X), a € 77 (Ent) 
with Wg(n,a) < oo one has Ent(Pfpm) < oo, ^^(Pfpm, a) < oo for all t > 0 and 

^ ^W|(Pfpm, cr) + y W|(Pfpm, cr) < Ent(u) - Ent(Pfpm) Vi > 0 . (11.25) 

We can now obtain the geodesic property of H(Ent) and the convexity of Ent, relative to 
Ws- This provides a link with the theory developed independently by Lott-Villani and Sturm 
of synthetic lower bounds on the Ricci tensor, based on convexity properties of Ent (see [46]). 

Theorem 11.4 (Geodesic convexity of the entropy functional). (77(Ent), Ws)) is an extended 
geodesic metric space: for every couple of measures p, v € 77(Ent) with Ws{nw) < 00 there 
exists a Ws-Lipschitz curve p* : [0,1] —>• 7)(Ent) such that 

Po = P, Pi = v, Ws(n a ,nt) = \t - s\Ws(n,v), s,t€ [0,1]. (11.26) 

In addition, the “finitary” length distance generated by Ws,* according to (3.8) coincides with 
Ws on Z7(Ent) x D(Ent) and Ent is K-convex on every curve as in (11.26): 

Ent(pt) < (1 — i) Ent(po) + iEnt(pi) — yi(l - i)Wf (p, v) . (11.27) 

Finally, for all p = pm £ 71 (Ent) the slope of the entropy coincides with the Fisher information 

4£(^/p) = | D ^ g Ent| 2 (pm) . (11.28) 

Proof. We are going to apply Corollary 3.6 with X = 77(Ent), d = Ws,*, F = Ent and S = P (we 
identify here measures with probability densities, as usual). We know from Theorem 11.1 that 
S provides a EVI/i--gradient flow of F in X, hence (£P a (X),Ws,*/) (where Ws,*/ is the length 
distance associated to Ws,* as in (3.8)) is a length space and the same holds for all sublevels 
{Ent < c}, c € [0, oo). On the other hand, since the sublevels are compact w.r.t. the weak 
L 1 (X,m) topology (thanks to Dunford-Pettis theorem), we immediately obtain that ({Ent < 
c},Ws,*) are complete, thanks to the lower semicontinuity of Ws,* w.r.t. the weak L 1 (X, m) 
convergence. It follows by Corollary 3.6 that on 77(Ent) x D(Ent) the distance Ws,*,i coincides 
with the upper length distance induced by Ws,*, namely Ws (by Proposition 10.11). 

Thanks to compactness, the length properties of the sublevels can be improved to geodesic 
properties by the remarks made after Corollary 3.6. We can now use Corollary 11.3 to improve 
the EVIx property from Ws,* to Ws, getting then the convexity of Ent along geodesics of Ws- 
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We need only to prove the inequality > in (11.28), since the converse inequality can be proved 
independently of curvature assumptions, see the proof of Lemma 8.1 in the metric setup and 
recall (10.10). We start from the observation that for any p £ L+(X, tn) one has 

4£(y // pfp) = |D(^ f Ent | 2 (Pfpm) for a.e. t > 0 

by looking at the energy dissipation rates from the L 2 point of view and by the Wg point of view 
(the latter is derived from the EVI/^ property). Now, if p £ V with inf p > 0 we can pass to the 
limit as t 0 along a suitable sequence and use the lower semicontinuity of |D(^- £ Ent | w.r.t. Wg 
convergence (derived from the convexity of entropy) to obtain the inequality > in (11.28). For 
general probability densities we argue by truncation, using once more the lower semicontinuity 
°f |Du/ £ Ent |. □ 

We conclude pointing out some standard consequences of the IL-convexity of Ent. 


Corollary 11.5 (Convexity of bounded densities, Log-Sobolev and transport inequalities). 

(i) If K > 0, then the sets {p = pm £ £P a (X) : ||p||oo < c} are geodesically convex w.r.t. Wg, 
i.e. every couple of measures pt = pirn, i = 0,1, with ||pi||oo < c and Wg(po, pi) < oo, can be 
connected by a geodesic pt = pttn as in (11.26) such that ||pt||oo < c for every t £ [0,1]. 

(%%) If K >0 and £ is irreducible according to (10.17), then the log-Sobolev inequality (10.13) 
holds with cls = K. In particular, thanks to Lemma 10.7, one has the Talagrand inequality 

yW|(/i,m) < Ent (p) Vp £ &> a (X) . (11.29) 


Proof. For the geodesic convexity of the sets {p = p m £ & a {X) : ||p||oo < c} the rescaling 
of transport plans as in [4, Prop. 3.3] applies (notice that the assumptions on the side of the 
supports made therein play a role only when K < 0). 

The metric argument of [2, Lem. 2.4.13], relying on I\- convexity, can be applied to give 

Ent(pi tn) - Ent(p 0 m) < t^I d ~ Ent 1 2 (/h tn) , 

for any pair of measures po = Pont, pi = p\ m in D(Ent) with Wg(po, p\) < oo. It p £ H(Ent) 
we apply this inequality with po = PtP and p\ = p and we let t —>• oo to get, by the irreducibility 
of £ and (10.18), 

Ent(pm) < —yt|D — Ent| 2 (pm) . 

2A 

Using (11.28) we obtain the log-Sobolev inequality. □ 


Corollary 11.6. The transport inequality (11.29) implies that the class of measures in 7P a (X) 
with bounded density w.r.t. m is dense in H(Ent) w.r.t. Wg. 

Proof. Let us approximate p by the measures p k := l{p<k}0/ a k, where ak = J] f ,<k} P t 1 
are the normalization constants. If p k are the densities of p k , writing bk = 1 — a& = f{ p> k} P ^nt 
and 

P ^{p<k}P T ( ^{p<k}P T bkp , p 4” 4{p>fc}P 

O'k 

adding the constant term l{ p <fc}P to the solutions to the continuity inequality we get 

Wj{p k ,p) < b k Wg(p k , yl {p>k} pm) . 

It suffices then to show that Y\m. k b k Wg(p k ,bfl l l{ p> k}pxn) = 0. To this aim, we compare both 
measures with m. The transport inequality then gives 

b k Wg(p k , m) < ^Ent(p fc ) -A 0 


and 


b k w e(^l{p> k }P,m) < 


[ plnp dm + b k ln(-^-) 

l{p>k} °k 


(11.30) 

□ 
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12. From differentiable to metric structures and conversely 

12.1. Energy measure spaces induce extended metric measure spaces. In this section 
(X,B,£, m) is an energy measure space according to Definition 10.1; following the construc¬ 
tion explained in Section 4, page 13, we are going to introduce an extended metric-topological 
structure starting from given a family C of pointwise defined real functions such that 

£c{/:X—>-M: / is Immeasurable and bounded, T(/) < 1} , 

. „ (12.1a) 

L separates points of X , 

so that (equivalence m-a.e. classes of) elements of C belong to Ag. Then 

t is the Hausdorff topology in X generated by C, (12.1b) 

i.e. r is the coarsest topology such that all the functions of C are continuous: (X, r) is automat¬ 
ically completely regular. Restricting tn to 38 (t) C B , we will assume that 

m G &(X) (i.e. m is Radon in 38(t)), supp tn = X ; (12.1c) 

we can then consider the class 

^={/€^nC 6 (X,r):T(/)<l} (12.1d) 

(where we identify functions in Ag with their unique r-continuous representative) containing C 
and use Ag to define canonically : X x X —>■ [0, oo] by 

ds(x,y) : = sup{|/(x) - f{y)\ : / G Ag} (12.le) 

so that (X,dg) is an extended metric space. 

In addition, if I is the collection of finite subsets of Ag, for i € I we define 

d i{x,y) := sup|/(x) - f(y)\ . (12.2) 

f& 

Notice that dj is only a semidistance, i.e. it is symmetric and it satisfies the triangle inequality. 
We shall also use the fact that d i(-,y) € Ag for all i/£l, with T(dj(-,y)) < 1 m-a.e. in X. If 
we endow X with the semidistance dj it is immediately seen that ( X , dj) is separable. 

We thus get that ( X , r, d,r) is an extended metric-topological space according to Definition 4.1 
and that (X, r, d^,m) is an extended metric measure space according to Definition 4.7. 

Remark 12.1. The typical case of this construction occurs when suppm = X and C can be 
identified with a subset of Ag (~l CfiX, To) for some preexisting topology To in X\ in this case 
the condition supp m = X provides uniqueness of the continuous representative and t is coarser 
than to, so that (12.1c) is satisfied if m € &{X, To). Notice that t = To if to is generated by L. 

Proposition 12.2. Under assumptions (12.1a), (12.1c) one has Wg > W^ £ , where Wgis 
defined in (10.8). 

Proof. Let dj as above. By Theorem 5.1 we need only to show that < Wg *. In order to 
prove this property, taking (5.3) and the comments immediately after into account, it suffices 
to show that 

Qlfi(x) := inf ' <j>(y) + ^-d 4 2 (x,y) 
yeX At 

is admissible in (7.10) whenever <f is bounded and dj-Lipschitz. By applying Lemma 12.3 below 
to Qlfit) we get T{Q\(j){-)) < \DiQ\4>{-)\ 2 m-a.e. in X, where |Dj/| denotes the slope w.r.t. d,. 
Taking into account the subsolution property (7.7) of Q\(j), we obtain dtQ\4>+T{Q\4>)/2 <0. □ 

Let d be a finite semidistance in X. In the proof of the next lemma we are going to use 
in ( X , d) the following links between the descending slope in (3.4) computed w.r.t. d and the 
functions Qtf, f £ Lip(X, d), provided by the Hopf-Lax formula 
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see [5, Sec. 3] for the proof (see also [2, Lem. 3.1.5]): 

|D/| 2 (x) > |D~/| 2 (x) = limsup f 1 ( B+ f( x ’ tr h 2 dr . 

40 Jo v tr ) 

Here, D + f(x,t) is defined by 


(12.4) 


D + f(x,t) := sup < limsupd(x n , x) : (x n ) minimizing sequence in (12.3) 

n —^oo 

It is not hard to show, by diagonal arguments, that D + / is upper semicontinuous in X X (0, oo), 
endowed with the product of d and of the Euclidean distance (see again [5, Sec. 3]). These results 
are stated in [5] for metric spaces, and they can be immediately adapted to degenerate space 
(X, d), just noticing that d (x,y) = 0 implies f{x) = f(y), he. lifting them from the quotient 
metric space to ( X , d). 

Lemma 12.3. Let d be a bounded (r x T)-continuous semidistance in X with (X, d) separable, 
d(-,y) E V and T(d(-,y)) < 1 m-a.e. in X for all y € X. Then, for all f : X —t R bounded and 
d-Lipschitz, denoting by |D/| the slope w.r.t. d. one has T(/) < |D/| 2 m-a.e. in X. 

Proof. In the proof, which follows closely [5, Lem. 3.12], we will use the following weak stability 
property of the T operator, which follows easily by Mazur’s lemma: if f n E V and f n —> / in 
L 2 (X, m), then 

y/r(f n ) -A G weakly in L 2 (X, m) implies G > y/T (/) m-a.e. in X . (12.5) 

If (zi) is a countable d-dense subset of X we define 

Qtf( x ) = min f( z i) + XTd 2 (^,x) , Q t f{x) = min f(y) + ^-d 2 {y,x) , (12.6) 

l<i<n It, y&X It 

and we set I n (x) := {i E {1,... ,n} : Zj minimizes (12.6)}. By the density of (zi), it is clear 
that Qtf | Qtf as n —>• oo. Therefore, if C, n {x) E I n (x), it turns out that (( n (x)) is a minimizing 
sequence for Qtf(x), namely 

^-d 2 (x, Cn(.T)) + /( Cn(x)) Qtf(x ) as 77. ^ OO . 

The very definition of D + /(x, t) then gives 

limsupd(x, ( n (x)) < D + f(x,t) . (12-7) 

n—> oo 

Since Qff(x) = f(zi) + d 2 (zi, x)/2t on (x : ( n (x) = Zi }, the locality property and the fact that 
d(zi, •) belongs to Acs together with the chain rule yield 

T (Qff)(x) < ^ max d 2 (x,Zi) for m-a.e. x E {Cn = zA . 

t- i£l„(x) 

If we define Cn(x) as the smallest index j, among those that realize the maximum for d(zi,x), 
i E I n (x), the previous formula yields 

T (Q?f)(x) < \d 2 (x,( n (x)) for m-a.e. x E X . (12.8) 

t z 

Since Qff and T (Qtf) are uniformly bounded and Qff converges pointwise to Qtf, considering 
any weak limit point G of y/T(Qff) in L 2 (X, m) we obtain by (12.5), (12.7) and (12.8) that 

(D + fix t)\ 2 

T (Qtf)(x) < G 2 (x ) < --tv- - for m-a.e. ill. (12.9) 

t z 

Since / is Lipschitz, it follows that D + f(x,t)/t is uniformly bounded and since d is (r x r)- 
continuous the function D + / is Borel in X x (0, oo). Integrating (12.4) on an arbitrary Borel 
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set A and applying Fatou’s Lemma, from (12.9) we get 

[ |D /| 2 dm > [ limsup [' ( D+ ^’ tr) Y dr dm(, 
J A J A 40 Jo \ tr J 


tr 


40 

> lim sup f 1 f ( D+ n^r) A 
40 Jo J A 


> lim sup 

40 Jo Ja 


dm(x) dr 


T(Qtrf)(x) dtn(x) dr 

> f lim inf ( f F (Q tr f) dm) dr > [ F(f) dm , 

Jo 40 \Ja 7 Ja 

where in the last inequality we applied (12.5) once more. Since A is arbitrary we conclude. □ 


The r-upper regularity has already been identified in [5] as a crucial compatibility condition 
between the topological and the metric/differentiable structures, needed to identify £ with a 
Cheeger energy. 

Definition 12.4 (r-upper regularity). Let ( X , B , £, m) be an energy measure space with 3S{r) C 
B for some topology r in X. We say that £ is r-upper regular if for all f E V there exist: 

(a) functions f n E Lip b (X,d,r) with fn—>fin L 2 (X, m); 

(b) bounded r-upper semicontinuous functions g n with g n > y / F(f n ) m-a.e. in X with 
lim sup n j gl dm <£(/). 

Theorem 12.5. Let (X , B , £, m) be an energy measure space and let (X , r, d^-, m) be the extended 
metric-topological structure associated to a set L as in (12.1a,b,c,d,e). Then £ < Chd £ and, in 
particular, Ws > Wch d£ - The equality £ = Ch^. holds iff £ is r-upper regular. In particular, if 
£ is r-upper regidar, one has: 

(a) the classes of 2-absolutely continuous curves /q = ptm w.r.t. Ws, Ws,* and W^ £ with p t € 
L°°(L°°(X, m)) coincide and the same is true for the corresponding metric derivatives; 

(b) If BE(K, oo) holds, the metric gradient flows of Ent w.r.t. Ws, Ws,* and W^ e coincide 
with Pf; 

(c) If BE(K, oo) holds and Pf maps Cb(X) in Cb{X), then Ws is the upper length distance 
in JP a (X) associated to Wd e according to (3.9). 

(d) IfBE(K, oo) holds with K > 0 (resp. K > 0 and P is irreducible), then Ws restricted to 
{p = pm : HpIIoo < c} (resp. D(Ent)j is the upper length distance in & a (X) associated 
to Wd £ according to (3.9). 

Proof. We prove first the inequality £ < Chd £ . Taking Theorem 9.1 and the lower semicontinuity 
of £ into account, it suffices to show that £ < Cffi, where Ch* is the Cheeger energy associated to 
the semimetric measure space ( X , r, dj, m) and d* is the monotone approximation of d^- illustrated 
by (12.2). By Lemma 12.3 we obtain 

S(f) < J |Dj/| 2 dm 

for all / : X -A- M bounded and d,-Lipschitz, where |D,;/| denotes the slope w.r.t. d^. Using 
Proposition 6.3(b) and once more the lower semicontinuity of £ we conclude. 

The necessity of T-upper regularity for the validity of the equality £ = Chd £ follows by 
applying Theorem 9.2 to the (rxr)-continuous semidistances d^ which monotonically converge to 
d^: one obtains the r-upper regularity (along a subnet i = /3(j )) with fl € Lip(X,r, dj) and with 
the dj-upper semicontinuous (and thus also r-upper semicontinuous) functions = Lip a (/j, dj, ■). 

For the sufficiency of r-upper regularity we follow the argument in [5, Prop. 3.11]. Thanks to 
the r-upper regularity, in order to prove that Chd £ < £ it is sufficient to show that / E D(Chd £ ) 
and that \F)f\ w ,d e < g m-a.e. in X whenever / E Lip(X, r, d^) and g is a bounded r-upper 
semicontinuous function such that g > yT (f) m-a.e. in X. By the very definition of Chy., 
/ E D(Chd £ ). In order to prove the inequality |D/|„, jC | £ < g m-a.e. in X we will prove the 
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inequality for the slope |D/|d £ . We need only, thanks to the upper semicontinuity of g, to prove 
the pointwise inequality |D/|d e < c in the r-open set U := {g < c}. By homogeneity, we can 
assume c = 1 and we fix xo € U; since r is generated by C we can find a finite collection {f n )n=i 
of elements of C and r > 0 such that 


Set 


F := {x e X : 


max 

l<n<N 


I fn(x) ~ f n (x 0 )| <r} CU . 


5{x) 


max 

l<n<JV 


I fn(x) ~ /n0o)|, 


l(x) := min{r, \f(x) - f(x 0 )|} , 


h(x) 


max{/(x), (5(x)} , 


and notice that 5 G Ag. Since {h = 1} = {l > 5} C F C U and since F(l) < 1 in U, by locality 
we get r (h) < 1 tn-a.e. in X\ since h is r-continuous we get h € Ag, so that (12.le) yields 

h(x) = h(x) — h(x o) < d^(x,xo) • 


Since the topology induced by is stronger than r, the r-continuity of / gives h(x) > | f(x) — 
f(x o)| for d^(x,xo) sufficiently small. It follows that |D/|d £ (xo) < 1. 

Finally, statements (a) and (b) follow by Corollary 7.5 and Theorem 8.3 of the metric theory, 
taking also the inequalities Wd £ < Wg.* < Wg into account. 

Let us prove now statement (c). It suffices to show that Wg{pom, p\\n) < f () \p s \ 2 ds for any 
absolutely continuous curve pt w.r.t. Wd e contained in £? a (X ), where \pt\ denotes the metric 
derivative w.r.t. Wd £ . Since £ is r-upper regular, we can identify £ with Chd £ and T(/) with 
|D/|^. By the definition of Wg, it will be sufficient to prove the inequality 


J 0 p, t dm 


4>p S2 dm 



r (ftp. 


\ 1/2 

s dm j \p s 


ds 


( 12 . 10 ) 


for all <f> £ Ag and 0 < si < S 2 < 1 . 

We start from the observation that for any i/j € Ag H Cb(X) and any bounded r-upper 
semicontinuous function g > m-a.e. in X the Lipschitz property of ifj w.r.t. df can be 

“localized” as in the first part of the proof of the theorem to get that ipor] is absolutely continuous 
in [0,1] for all 77 € AC{[ 0,1]; (X, d^)) with |('tp o g)'\ < g(rj)\rj\ a.e. in (0,1). By integrating this 
inequality along a test plan 77 representing the curve pt we get 


J ipp Sl dm - J ipp S2 dm <| (^ j 


\ 1/2 

g 2 p s dm j | p s 


Now, as in [5, Thm. 3.15] we can apply the regularization property 

Pf : L°°(X, m) —>■ Ag , ||r(Pf /) IU < c(K, t, ||/||oo) V/ € 


ds . 

L°°(X, m) 


derived from BE(it', 00 ) with T-calculus techniques (see for instance [5, Cor. 2.3] for a proof) 
and the Feller property Pf : Cb(X) —>• Cb(X) (which implies, by monotone approximation, that 
the class of bounded r-upper semicontinuous functions is invariant under Pf) to get from the 
previous inequality with ip = P f<f> the inequality 



P t<t>Ps 2 


< 


1: (/ 


(c A e 


~ 2Kt P e t g 2 )p t 


\ 1/2 

dm j | p s 


ds 


with c = c(K,t,\\cp || 00 )- Now, for all <p € Ag we exploit once more the r-upper regularity 
assumption, using in the previous inequality functions cp n € Ag with ||</> n ||oo < Halloo and 
bounded r-upper semicontinuous functions g n > y/T((p n ) with <p n —>• (p and g n —» yjT(<p) in 
L 2 (X, m), to get 

Pt<l>Pai dm - J P S t (pps 2 dm 

/ «2 / r \ !/ 2 

y P*mPs dmj \p s \ ds . 



< 


f/■■ 


c A e~ 2Kt P^T((p))p. 


\ l/Z 

dm J 


\p s \ ds 


Eventually we can take the limit as t P 0 to obtain (12.10). 
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The proof of (d) in the case K > 0 is similar and uses the convexity properties of {p = pm : 
IIpIIoo < c} to avoid the regularization based on the Feller property Pf : C&(X) —>• C b (X). In 
the case K > 0 the result can be extended to D(Ent) thanks to the W^-density of {p = pm : 
p G L°°(X, tn)} in D( Ent), ensured by Corollary 11.6. □ 


12.2. Extended metric measure spaces induce energy measure spaces. In view of the 
results of this section, it is useful to consider the case when Ch is a quadratic form, namely to 
assume that the parallelogram identity holds: 

Ch(/ + g) + Ch(/ -g) = 2 Ch(/) + 2 ChQ?) for all /, g G L 2 (X, m) . (12.11) 

Definition 12.6 (Asymptotically Hilbertian spaces). We say that an extended metric measure 
space (X, r, d,m) is asymptotically Hilbertian if Ch satisfies the parallelogram identity (12.11). 


In the proof of the next theorem we will also need the following calculus property, borrowed 
from [4]. 


Lemma 12.7 (Plan representing the gradient, horizontal and vertical derivatives). Assume that 
(X, d) is complete and let u G D( Ch) n L°°(X, m). Then there exists a test plan a G &{X 1 0,1 1) 
representing the gradient of u in the following sense: 


E t uoe 0 -uoe t 

inn — = lim- 

40 t 40 E t 


|Du|^oeo in L 2 (X^ 0,1 \a) , 


where Et{rj) := t |?)(s)| 2 ds. Moreover, for any f G D( Ch) one has: 


lim inf 
40 , 


/(*/(*))-/fo(o)) 


dcr > lim sup 

40 


|D' u Imi( ? ?(o)) — |d( m + £ /)l«>( ? ?(o)) 

2e 


dcr . (12.12) 


Proof. The first result is proven as in [4, Lem. 4.15] and relies on Proposition 5.5 applied to the 
semigroup P* starting from v := ce u (with c suitable normalization constant), and then defining 
cr := u _1 o eor]. The possibility to apply Proposition 5.5 to the gradient flow is ensured by (6.4) 
and Theorem 7.3. The second result is obtained as in [4, Lem. 4.5]. □ 


Theorem 12.8. // (X, r, d,m) is an asymptotically Hilbertian extended metric-topological mea¬ 
sure space with (X, d) complete, and if Ch denotes the associated Cheeger energy, defining 
£ := Ch one has that (X, £, m) is an energy measure space according to Definition 10.1. 

In addition: 


(a) T(/) = |D/|^ for any f G ¥(1 L°°(X,m) and the formula 


T(f,g) := lim 

e^O 


l D (/ + g g)ll - |D/lt 

2e 


in L 1 (X, m) 


extends the T operator from Y n L°°(X, m) to the whole ofV. 
(b) the extended distance 


de(x,y) := sup{|/(x) - f(y)\ : f G Vfl C b (X), T(f) < 1} 


(12.13) 


satisfies > d and, denoting by Ch^ the Cheeger energy associated to the new metric- 
topological structure (X, r, df,m), one has Chd £ = £. 

(c) d^ = d */ and only i// G VnCj(X) and ID/I^ < 1 m-a.e. in X implies f G Lip 6 (X, r, d) 
with Lip(/, d) < 1. 


Proof. By the locality and chain rule properties of / i-» |D/|^ stated in Proposition 6.3(c,e), the 
asymptotically Hilbertian assumption ensures that £ := Ch is a strongly local and Markovian 
Dirichlet form in L 2 (X, m). 

The proof of statement (a) can be obtained as in [4, Sec. 4], see in particular [4, Thm. 4.18]. 
For convenience we briefly sketch the proof. Let us set for /, g G V 

|D(/ + eg)\w — |D/|) 


G(f,g) ■= lim- 


12 , 


2e 


in L 1 (X, tn) 
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Note that the limit above exists as a monotone limit by convexity of the minimal weak upper 
gradient, and that £(f,g ) = f G(f,g) dm. One first establishes the following chain rule for G: 
for all /, g £ V and 4> : JR —^ R non-decreasing, Lipschitz and C 1 , one has 

G(f,(j>(g)) = (j)'{g)G(f,g), J G((p(g), f) dm = J (j)'{g)G(g, f) dm . (12.14) 

This is proven arguing as in [4, Lem. 4.7]. Moreover one follows [4, Prop. 4.17] using Lemma 12.7 
to establish the Leibniz rule 


£(f, gh ) = J hG(f , g ) + gG(f, h) dm (12.15) 

for all /, g, h € V n L°°(X, m) with g, h> 0. 

To prove the claim it is sufficient to show that G is bilinear and symmetric and therefore in 
turn it is sufficient to prove that />->■/ L|D/|^ dm is quadratic for every nonnegative bounded 
Borel h or, by approximation, fteVn L°°(A, m). Now use (12.14), (12.15) to write 

J h\Df\l dm = J hG(f, f ) dm = -£h) + £(f, fh) . 

We conclude by quadraticity of the terms on the right hand side. 

The proof of statement (b) is easy, since all functions / £ Lip 6 (X, r, d) belong to Cb{X) and 
yT(/) = ID/Im < Lip(/) m-a.e. in X. It follows that all the distances dj approximating d from 
below are admissible in (12.13), so that d£ > d. Since Theorem 9.2 ensures that £ = Ch is 
T-upper regular, by Theorem 12.5 we obtain Chd £ = £. 

In order to prove statement (c), notice that the inclusion {/ € Cb{X) fl¥ : \Df\ w < 1} in 
the class of 1-Lipschitz functions w.r.t. d implies, by the very definition of dg, that d^ < d. The 
converse is obvious, again by the definition of d^. □ 


13. Examples 

Here we collect natural examples of energy measure spaces and extended metric measure 
spaces. 


13.1. Degenerate Dirichlet forms. Consider X = M 2 equipped with the usual topology r, the 
Borel c-algebra B(r), and the standard Gaussian measure j(dx) = (27r) _1 e _ l 3; l / 2 dx. Consider 
further a Dirichlet form measuring energy only in the first coordinate, i.e. 

£(f) = J \dif(x 1 ,x 2 )\ 2 dr/(xi,x 2 ) 


for / £ L 2 (M 2 ,m) with f(-,x 2 ) £ VL^’^M) for a.e. x 2 £ M. Then (M 2 , B(t), £, 7 ) is an energy 
measure space according to Definition 10.1. 

As the distance generated from the algebra according to (12.Id), (12.le) one obtains 


df ((xi,x 2 ), (yi,y 2 )) 


\x\ — 2 / 1 1 if x 2 = y 2 , 
+00 else 


and (X,T,dg,'y) is an extended metric measure space according to Definition 4.7. 

Note that the Bakry-Emery condition BE(l,oo) is satisfied for this Dirichlet form, but £ is 
not irreducible (see (10.17)); in fact, it is easy to construct measures with bounded densities 
but infinite Wg distance from tn. Using a standard approximation by restriction, one can check 
that £ is T-upper regular. It follows that £ coincides with the Cheeger energy induced by dg, 
by Theorem 12.5. 
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13.2. Abstract Wiener spaces. Consider a separable Banach space X (or, more generally, a 
locally convex topological space) and a centered, non-degenerate Gaussian measure 7 in X. The 
Cameron-Martin space C X is the image under the mapping 

Rf ■= f f(x)x dy(x) / € L 2 (X, 7 ) 

of the so-called reproducing kernel of 7 , namely the closure H in L 2 (X, 7 ) of {(x', x)} x ' & x' 
(here X' is the topological dual of X and {•,•) is the canonical pairing). It is easily seen that 
R : H —>• Jf? is injective, hence inherits from L 2 (X, 7 ) a Hilbert structure. The structure 
(. X , 7 , Jf?) is called abstract Wiener space. 

Denote by J4? b °° the set of smooth, bounded cylinder functions, i.e. the functions / of the 
form f(x) = (p^XijX), ..., (, x' n ,x )), for x' l: . .. ,x' n G X' and <f> : M n —>• M smooth and bounded. 
For such a cylinder function we define its gradient via 

Vjr/(*) = . 

i=1 1 

It is well-known that the quadratic form 

£(f) = I |Vjr/l5r d 7 / e JtT 

is closable, that J4? b °° is dense in L 2 (X, 7 ) and that £ admits a carre du champ operator, see 
[15]. Thus (X,B(t),£, 7 ), with r being the weak or strong topology of the Banach space X is 
an energy measure space according to Definition 10.1. 

The associated semigroup P is given by Mehler’s formula 

Ptf(x ) = J /(e _t x + y/l- e~ 2t y) d j(y) . 


From this one can check that the Bakry-Emery condition BE(l,oo) holds. Using a Rademacher 
type theorem for abstract Wiener spaces, see [20], one can check that the induced distance 
according to (12.Id), (12.le) is the Cameron-Martin distance 


d e{x,y) 


\x - y \jr if x - y G Jif , 
+00 else . 


Thus (X, r, d£, 7 ) is an extended metric measure space according to Definition 4.7. 


13.3. Configuration spaces. Configuration spaces appear naturally as the state space for 
systems of infinitely many indistinguishable diffusing particles. Let M be a Riemannian manifold 
with metric tensor The configuration space T over M is the set of all locally finite counting 
measures, i.e. 

T = {7 € M{M) : 7 (K) € No V/\ C M compact} . 

The space T is equipped with the vague topology, denoted by r, in duality with continuous and 
compactly supported functions. 

A natural differentiable and energy structure on the configuration space has been introduced 
in [1], by lifting the geometry on M, as we shall briefly describe. The tangent space T 7 T consists 
of all 7 -square integrable vector fields on M and is equipped with the inner product 

(Vi,V 2 ) y = j (Ui(*),U 2 (.t)) x d 7 (x) . 

Let Cyl £° be the set of smooth and bounded cylinder functions, i.e. functions F : T —>• R of 
the form F( 7 ) = g( 7 (^ 1 ),..., 'y(4> n )) where g G C'^°(R n ) and fa,..., 4> n € C£°(M) and we write 
7 ( 0 ) = f cj) d 7 - Given F G Cyl we define its gradient at 7 as the vector field on M given by 

71 ft 

r 7 r 9 v t f( 7 ) = J2 ( 7 (^ 1 ), ■ ■ ■, 7 . 

i= 1 ~ Jl 
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Similarly, for a cylindrical “vector field” on T of the form W = L) V) with F t E Cyl “ and V) 

smooth, compactly supported vector fields on M, we define its divergence as 

div x (W)( 7 ) = ^(V T F i ,V i ) 7 + F i ( 7 ) 7 (divV i ) . 


The natural reference measure on T is the Poisson measure it, that can be characterized by its 
Laplace transform, i.e. for all / E 



(7(/)) <M 7 ) = exp 


<M 


exp (/(x)) — 1 dvol(x) 


This is (up to the intensity) the unique probability measure such that the gradient and the 
divergence are adjoint in L 2 (T,7r), see [1, Thm. 3.2], The quadratic form 


£(F,F) 


j (V X F, V x ) dvr 


defined on Cyl is closable to a Dirichlet form admitting a carre du champ operator, see [1, 
Cor. 1.4], [38, Prop. 1.4], so that (T, B(t), £, tt) is a Energy measure space according to Defi¬ 
nition 10.1. The associated semigroup is given by the evolution of infinitely many independent 
Brownian particles on M. 

The induced distance according to (12.Id), (12.le) is known to be the L 2 -transport distance 
between (non-normalized) configurations (see [38, Thm. 1.5]), i.e. 


df( 7 ,??) = inf 


d 2 {x,y) d q(x,y) 


where d is the Riemannian distance and the infimum is over all couplings q of 7 and y. (X , r, d e , n) 
is now an extended metric measure space according to Definition 4.7. 

It is shown in [21, Prop. 2.3] that £ coincides with the Cheeger energy constructed from dg 
(as defined in [3] based on Lipschitz constants, but similar arguments apply to the construction 
used here based on asymptotic Lipschitz constants). 

A detailed study of curvature bounds for configuration spaces can be found in [21]. It has been 
shown that various notions of curvature bounds lift from the base space M to the configuration 
space. In particular, if the Ricci curvature of M is bounded below by I \, an Evolution Variational 
Inequality and the Bakry-Emery gradient estimate with constant K for the semigroup hold on 
the configuration space. 
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